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1. Introduction 



Computing integrals is a basic need in many areas of mathematics and 
applied sciences. It is a common experience that some integrals can 
be calculated "by hand" , obtaining a "closed formula" , while others 
cannot be reduced to a simple expression. This fact is at the origin of 
numerous approximation techniques for integrals, such as quadratures, 
series expansions, Monte-Carlo methods, etc. The purpose of these 
notes is to start what seems to be a new line of investigations in 
studying the behavior of integrals of the form 



for sets A far away from the origin. In this integral, dx is the Lebesgue 



Such integral is in general very difficult to compute numerically. 
Indeed, in interesting applications d is between 10 and 50 say, and 
quadrature methods are essentially out of considerations. On the 
other hand, as A is far away from the origin, the integral is small 
— the interesting range for some applications is when the integral is 
of order IQ-^, lO'^, or even smaller — and standard Monte-Carlo 
methods fail to provide a good evaluation at a cheap cost. 

The motivation for this study comes mainly from applications in 
statistics and probability theory. However, the techniques developed 
should be useful in other areas of mathematics as well, wherever such 
integrals arise. 

Before going further, let us show some consequences of our main 
approximation result in probability and statistics. We hope that the 
reader will find in these examples good motivation to continue reading. 

Consider the following two density functions on M , 





(1.1) 



measure on W^, and the function / is integrable over W^. 
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where 

'a + 1 



r 



I^-iii rv — cUld K o fy — — . 



a 



so Wa and integrate to 1 on the real line. The density W2 is the 
standard normal distribution. The density Sa is that of a Student 
distribution. These two functions are very different. The symmetric 
Weibull-like, Wa, decays exponentially at infinity, while the Student 
one decays polynomially. 

Let us now consider a real d x d matrix C — {^i,j)i<i j<d with at 
least one positive term on its diagonal. We write (■ , ■) the standard 
inner product on W^, and consider the domain 

At = {xeM.'^ : {Cx,x) ^t}. 

As t tends to infinity, it is easy to see that the set At — \fi,A\ pulls 
away from the origin since A\ does not contain 0. We will explain 
the following in section 8.1. If a 7^ 2, then there exists a function — 
rather explicit, and in any case computable — c(-) of a, d and C, such 
that 

/ n "^a(^Odxi • • • ~ c(a, d, C)e-*"^'<=H-("-2)i-f , 

as t tends to infinity. When a = 2, the asymptotic behavior of the 
integral is different. Namely, if k is the dimension of the eigensubspace 
associated to the largest eigenvalue A of + C, 



as t tends to infniil v- Comparing the forniTila for a equal or different 
from 2, we see that the power of namely [a — 2)d/A — 01/2 for a 
different from 2 and {k — l)/2 for a equal 2, has a discontinuity in 
a = 2, whenever k is different than 2. Moreover, we will see in section 
8.2 that 
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as t tends to infinity. These estimates give approximations for the 
tail probabihty of the quadratic form {CX, X) evaluated at a random 
vector X having independent components distributed according to Wa 
or So;. We will also see what happens when all the diagonal elements 
of C are negative. 

Consider now d = n? for some positive integer n, and write M(n, M) 
for the set of all real n x n matrices. Let 

At = {xe M(n,M) : deta; ^ t} 

be the set of all n x n real matrices with determinant at least t. 
The set Ai is closed and does not contains the origin. One sees that 
At = t^/^Ai moves away from the origin when t increases. We will 
prove that, as t tends to infinity, 



ce 



-t^ ln^(a(p?-\)-2v?)l2n [f ^ ^ 2 



while 



This gives a tail estimate of the distribution of the determinant of a 
random matrix with independent and identically distributed entries 
from a distribution Wa or Sq. The constant c depends on or Wa as 
well as on the dimension n; but it does not depend on t and will be 
explicit. 

Possible applications are numerous. We will deal with further 
interesting examples, such as norms of random matrices, suprema of 
linear processes, and other related quantities. 

Though some specific examples could be derived with other meth- 
ods — in particular those dealing with the Gaussian density integrated 
over rather simple sets — we hope that the reader will enjoy having a 
unified framework to handle all those asymptotic problems. It has the 
great advantage of providing a systematic approach. It breaks seem- 
ingly complicated problems into much more manageable ones. It also 
brings a much better understanding of how the leading terms come 
out of the integrals. Through numerical computations we will also see 
that our approximations are accurate enough to be of practical use in 
statistics, when dealing with some small sample problems — the type 
of problem for which there are no systematic tools as far as the author 
knows, and which originally motivated this work. 
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Another purpose of these notes is to investigate asymptotics for 
conditional distributions, and Gibbs conditioning. For instance, 
consider n points in M", whose coordinates are independent and 
identically distributed random variables. They form a parallelogram. 
Given, say, the volume of this parallelogram, these points are no longer 
independent. What is their distribution? In general, this seems to be 
a very difficult question, except if one stays at a very theoretical level 
and does not say much. We will show that it is possible to obtain 
some good approximation of this conditional distribution for large 
volumes of the parallelogram — mathematically, as the volume tends 
to infinity. 

It is important to realize that conditional distributions are rather 
delicate objects. To convince the reader of the truth of this assertion, 
let us consider four elementary examples where everj^hing can be 
calculated explicitly and easily. 

Example 1. Let X, Y be two independent standard normal random 
variables. We are looking for the distribution of (X, Y) given X + Y ^ 
t, for large values of t. A simple probabilistic argument to obtain 
a limiting distribution is as follows. Let U = {X + Y)/V2 and 
V = {X - Y)/V2. The distribution of X given X + Y ^ tis that 
of {U + V)/V^ given U ^ t/\/2. One can check by hand that the 
conditional distribution of U/t given converges weakly* 

to a point mass at l/\/2. Since U and V are independent standard 
normal, and since {X, Y) is exchangeable conditionally on X + Y, the 
pair {X,Y)/t converges in probability to (1/2,1/2) conditioned on 
X + Y^t. 

Consequently, if we are given X + Y ^ t for large t, we should 
expect to observe X/t « Y/t ^ 1/2. Hence, X and Y are both large, 
and about the same order. 

Example 2. Consider the same problem as in example 1, but with 
X and Y independent and both exponential. For any a € (0, 1), 



P{X ^at;X + Y ^t} = / e'"' e'^ dx dy = e'^dx 



P{X ^ at\X + Y^ t} = 



P{X ^at;X + Y ^t} 
P{X + Y^t} 



We can explicitly calculate 





= ate 



-t 
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On the other hand, X + Y has a gamma distribution with mean 2. 
Integration by parts yields 

POO 

P{X + Y;^t} = J are-Mx = (t + l)e-* . 
It follows that 

P{X ^at\X + Y ^t} = a ^ 



t + 1 



Thus, given X + Y ^ t, the distribution of X/t converges to a uniform 
distribution over [0,1]. Again, by conditional exchangeability, the 
same result holds for Y/ 1. In conclusion, the distribution of {X, Y)/t 
conditioned onX+Y ^ t converges weakly* to a uniform distribution 
over the segment a(l,0) + (1 — Q;)(0, 1) in M?. So, we do not have 
the kind of degeneracy that occurs in Example 1. Conditioned on 
X + Y ^ t, the random variables X and Y should still be of the same 
order of magnitude, t, but not necessarily close to each others after 
the rescaling by 1/t. 

Example 3. Consider the same problem, with now X and Y inde- 
pendent, both having a Cauchy distribution. Elementary calculation 
shows that 



f°° dx f°° dx 1 
P{X^s}= _-_~/ _ 

Jg 7r(l + X'^) Jg -KX^ TTS 



as s ^ OO . 



Moreover, the stability of the Cauchy distribution — see, e.g., Feller, 
1971, §11.4 — asserts that {X + Y)/2 has also a Cauchy distribution. 
Consequently, 

lim P{ X ^ ei , y ^ et I X + y ^ t } 

t— +00 

P{X^et;Y^et} 
P{X + Y^t} 

This proves that we cannot have both X and Y too large as t tends 
to infinity. Moreover, for s positive. 



P{X + Y^t} 2(1 + s) 

as t tends to infinity. Consequently, given X + Y ^ t, the random 
vector {X,Y)/t has a distribution which converges weakly* to {Px + 
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Py)/2 where Px (resp. Py) is a Pareto distribution on the x-axis (resp. 
y-axis) . 

These three examples show that we can obtain very different behav- 
iors, with various degrees of degeneracy for the hmiting conditional 
distributions. But they also show examples of nonconceptual proofs. 
They provide no insight on what is happening. All the arguments 
are rather ad hoc. However, it is intuitively clear that the limiting 
conditional distribution has to do with the tail of the distribution of 
(X, y). Specifically, if we write f{x,y) for the density of {X,Y) and 
Af = {{x,y) : X + y ^ t}, we are led to consider expressions of the 
form 



When looking at J^^ f{x, y) dx dj/, examples 1, 2, 3 have very different 
features. Intuitively, the behavior of the integral has to do with how 
the set At lies in compared to the level sets of f{x,y). Examples 
1, 2 and 3 correspond to the following pictures. 





Example 1 



Example 2 
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Example 3 



In the three pictures, we see the gray shaded set At, its boundary, 
and the level sets 



for c = 1, 2, 3, . . . The fact that the sets Ac are getting closer and closer 
as c tends to infinity in example 1 expresses the very fast decay of the 
Gaussian distribution. At the opposite extreme, in example 3, the 
level sets are further and further apart because the distribution has a 
subexponential decay. These pictures show why in examples 1 and 3 
we obtained some form of degeneracy in the limiting distribution. In 
example 1, the density function is maximal on the boundary of At at 
one point whose two coordinates are equal. In example 3, the density 
is maximal at two points on the boundary of At. These two points 
have one coordinate almost null, while the other coordinate is of order 
t. But one should be careful, as rough pictures may not give the right 
result if they are read naively in more complicated situations. Our 
fourth example is still elementary and illustrates this point. 

Example 4. Consider (X, Y) where X and Y are independent, both 
having a Cauchy distribution with density si . We are now interested 
in the tail distribution of XY and in the distribution of {X, Y) given 
XY ^ t for large t. By symmetry, it is enough to consider the same 
distribution but adding the conditioning X ^ and Y ^ 0. 



OnX,Y > 0, set U = logX and V = logF. The density of U is 



Ac = {{x,y) :f{x,y) = e 



e' 



,u 



e 



—u 



fu{u) = 



e"si(e") = 



7r(l + e2") 



as u —>■ oo . 



TT 



8 



Chapter 1. Introduction 



Thus, the level sets of the density of (U, V) far away from the origin 
look like those of the exponential distribution of example 2. Hence, 
when studying the distribution of U given U + V ^ logt — i.e., 
X given XY ^ t — we could expect to have a behavior similar to 
example 2. However, the picture looks as follows. 



In particular, write At = {{x,y) : xy ^ t} for the set on which 
we integrate the density of {X, Y) . The boundary dAt has a unique 
contact point with the maximal level set of the density intersecting 
At, where by maximal level set we mean Ac such that Ac+e Ci At is 
nonempty if e is positive, and is empty if e is negative. In that aspect, 
we are close to examples 1 or 3. This example 4 will show us two 
more things. First we need to take into consideration the decay of 
the density — which explains why we will not have degeneracy as in 
examples 1 or 3. Second, it shows that whichever resTilt we can prove 
will have a lot to do with the shape of the domain considered and how 
this domain pulls away from the level sets of the density. 

To see what happens in this fourth example is still rather easy. We 
first calculate 




Example 4 





(1.2) 



It is true that 




as u ^ oo . 
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However, replacing (7r/2) — arctan(t/x) by x/t in (1.2) leads to a 
divergent integral. So, we need to be careful. Let 77 be a positive real 
number. There exists a positive e such that for any u> 1/e, 



TT 

— — arctan u e 



Consequently, 

'■et da; 



1 — ry 1 + ry 



u 



r t\ 

I — — arctan — — 



f / ^ r* Xdx 1 + 77 , , 



< 



Moreover, 

^ / TT — arctan - ^ — / — :r = -— . 

Therefore, using log(l + e^t^) ~ 2 logt as t tends to infinity, we have 

TT^ t 

Next, for a G (0, 1), the same argument shows that 

1 f°° dy dx 



p{XY^t-x^e} = \ r I 

Jf- Jt, 



1 + 1 + 



t/x 

I f°° (TT t\ dx 

-t: — — arctan - 



TT^ J fa V 2 X/ 1 + X^ 

1 X dx 
7r2 J^c ll + x^ 

^(logei-logr + o(l)) 
logi,^ . 

— — a) as t — >■ 00. 



Consequently, we obtain 
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It follows that the distribution of {X, Y)/t given XY ^ t converges 
weakly* to 5(o,o)j showing a degeneracy like in example 1, but of a 
different nature. A linear normalization is not suitable, so we must 
use a logarithmic scale. Writing X = eie^ and Y = 626^ with ei 
and 62 taking values +1 or —1 with probability 1/2, we proved that 
the distribution of {U,V)/ logt given XY ^ t converges weakly* to 
a uniform distribution over { (s, 1 — s) : s G [0, 1] }, which is very 
similar to example 2. 

Another interesting feature about examples 1-4 is that they use 
some rather specific arguments in each case, since everything could be 
calculated quite explicitly. When looking at more complicated distri- 
butions, or at more complicated conditioning, or in higher dimensions, 
we cannot rely so much on elementary intuition. One of the main 
goals of the present work is to give a systematic procedure for doing 
the calculation. The key point is that we will be able to transform 
the problem into one in asymptotic differential geometry, i.e., a con- 
junction of asymptotic analysis and differential geometry. In practice, 
the approximation of the conditional distribution will boil down to a 
good understanding of the contact between dAf and the level sets of 
the density. Having to deal with purely geometrical quantities will be 
helpful because differential geometric arguments will not depend on 
the dimension, and also since differential geometric quantities, such 
as curvatures, can be computed from different formulas, using very 
different parameterizations. Whichever parameterization is the most 
convenient will be used. In some sense, this is very much like changing 
variables in integrals, and we hope to convince the reader that once it 
has been learned, it considerably simplifies the analysis. The disad- 
vantage is that it requires investing some time in learning the basics 
of differential geometry and asymptotic analysis — but can we really 
hope to solve every problem with elementary calculus? 

At this stage, I urge the reader to look at the conditional distri- 
bution of the parallelogram given its volume mentioned in the intro- 
duction. In terms of linear algebra, the question amounts to looking 
at the conditional distribution of a random matrix with independent 
and identically distributed coefficients, given that its determinant is 
large. For 2x2 matrices, we are already in R^, since we have 4 coeffi- 
cients. The condition that the determinant is larger than t determines 
a subset whose boundary is of dimension 3, and it is impossible to visu- 
alize it. Such a simple example already shows the need for systematic 
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methods relying as little as possible on intuition. 

Let us now say a few words on how our results are connected to the 
classical Laplace method. They can be viewed as a Laplace method 
with an infinite dimensional parameter. 

In order to discuss this point, and because some understanding of 
Laplace's method may be helpful in reading the next pages, let us 
state and prove the following elementary result. Writing I = — log /, 
consider an integral of the form e~^^^^Mx. 

LI. THEOREM. (Laplace's approximation) Let A be a compact 
set in and I be a twice differentiable function on M*^. Assume that 
I has a minimum on A at a unique interior point a*, and that D^/(a*) 
is definite. Then, 



Proof. Let e be a number between and 1/2. Since a^, is in 
the interior of A and / is twice differentiable, we can find an open 
neighborhood U of a* on which 



Moreover, there exists a positive r] such that I{x) ^ /(a*) + on 
AnU^. Consequently, 




as A oo . 



(D^/(a*)(a; — a*) , x — a*) ^ I{x) — /(a*) 

^ -^—{D^I{a){x - a*), X - a*) . 





^g-A(7(a,)+'?)|^n[/c 



The change of variable h = ^/X{^--€j{x — a*) yields to the upper 
bound 




^g-A(/(a*)+,,)|^p^c 
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Since a^, is an interior point of A, the set \/X{l — e){A f] U — a*) 
expands to fill M*^ as A tends to infinity. Therefore, 



J A (detD2/(a.)) ^ 



for A large enough. A similar argument leads to the lower bound where 
the term (1 — 2e)~'^/^ is replaced by (1 + 2e)~'^/^. Since e is arbitrary, 
the result follows. ■ 

How is our integral J^e^^^^^dx related to Laplace's method? If 
A = tA\ for some fixed set Ai , and if / is a homogeneous function of 
degree a, we see that 

/ e-^(^)dx = t'^ / e-'^'y^dy = t'' [ e'^^^^^^dy. (1.3) 
JtAi Jai Jai 

On one hand Laplace's method is directly applicable provided / has 
a unique minimum on Ai. On the other hand, the integral on the left 
hand side of (1.3) is like (1.1). Having a good estimate for (1.1) will 
yield an estimate of (1.3). 

When studying (1.1), one can try to argue as in Laplace's method. 
Writing I{A) for the infimum of / over A, we obtain 

/ e-'^-Ux = e-^(^) [ e-(^(-)-^(«*))da; . 
Ja J a 

We can hope to have a quadratic approximation for I{x) — I{a^). 
But if we consider an arbitrary set far away from the origin, there 
is no reason for / to have a minimum at a unique point on A, and 
there is even less reason to have this infimum in the interior of A. In 
general attains its minimum on a set Va depending on A, which can 
be very messy. Roughly, we will limit ourselves to situations where Va 
is a smooth /c-dimensional manifold, which still allows some room for 
wiggling curves and other rather nasty behaviors. In essence, thinking 
of A as parametrized by its boundary dA, our method consists in 
applying Laplace's approximation at every point of T>a along fibers 
orthogonal to Va, and integrating these approximations over dA, 
keeping only the leading terms. The difficulty is to obtain a good 
change of variable formula in order to extract the leading terms, to 
keep a good control over the Jacobian of the transformation, and to 
control all the error terms. In doing that, the reader will see that 
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it amounts to a Laplace method where the parameter A is now the 
infinite dimensional quantity dA. 

Since we allow so much freedom on how the set A can be, we will 
see that not only should we look at points were / is minimized, but 
also at points a; in ^ such that I{x) — I{A) stays bounded — this is 
of course not a rigorous statement — or even is unbounded but not 
too large compared to some function of I (A). 

Approximating an integral of the form J^e^^^^^dx for arbitrary 
sets A far away from the origin and arbitrary functions / seems a very 
difficult task. For the applications that we have in mind, we will only 
concentrate on sets with smooth boundary. We will also require that / 
be convex. This last assumption may look very restrictive. However, 
by not putting too many restrictions besides smoothness on dA, one 
can often make a change of variable in order to get back to the case 
where / is convex. This is actually how we obtained the estimates 
of the integrals over s„ given at the beginning of this chapter. As 
the reader can see, — log Sa is all but a convex function. This idea of 
changing variables will be systematically illustrated in our examples. 

We now outline the content of these notes. 

Chapters 2-5 are devoted to the proof of an asymptotic equivalent 
for integrals of the form e~^^^Mx for smooth sets A far away 
from the origin and convex functions / — plus some other technical 
restrictions. The goal and culminating point is to prove Theorem 5.1. 
The main tools come from differential geometry and related formulas 
in integration. A reader with no interest in the theoretical details can 
go directly to Theorem 5.1, but will need to read a few definitions in 
chapters 2-4 in order to fully understand its statement. 

In chapter 6, we consider the special case where A is the translate 
of a fixed set. 

A second basic situation is studied in chapter 7, where A = tAi is 
obtained by scaling a fixed set Ai which does not contain the origin, 
t tends to infinity and / is homogeneous. In this case, we will overlap 
and somewhat extend the classical Laplace method. 

In chapter 8, we study the tail probability of quadratic forms of 
random vectors. If X is a random vector in and C is a dxd matrix, 
we seek an approximation of P{ {CX,X) ^ t} for large t. We focus 
on the case where X has a symmetric Weibull type distribution or a 
Student type distribution. These two cases yield different behavior 
which we believe to be representative of the so-called sub-Gaussian 
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and heavy tails distributions. They also illustrate the use of our main 
approximation result. 

The next example, about random linear forms, developed in chapter 
9, requires more geometric analysis. The problem is as follows: if 
X is a random vector on M*^ and M is a subset of W^, how does 
P{supp^]^{X,p) ^ t} decay as t tends to infinity? Again, our main 
theorem provides an answer and we will also focus on symmetric 
Weibull- and Student-like vectors. These two distributions capture 
the main features of the applications of our theoretical result to this 
situation. 

The last example, treated in chapter 10, deals with random ma- 
trices. Specifically, if X is an n x n matrix with independent and 
identically distributed coefficients, we seek for an approximation of 
P{detX ^ for large t. Again, we will deal with Weibull- and 
Student-like distributions. In the last subsection, we also approxi- 
mate P{ II AT 1 1 ^ t} for large t, the tail distribution of the norm of the 
random matrix. This last example yields some interesting geometry 
on sets of matrices, and turns to be an application of the results on 
random linear forms obtained in chapter 9. 

The last chapters address some more applied issues, ranging from 
applications in statistics to numerical computation. 

Chapter 11 presents some applications to statistical analysis of time 
series. We will mainly obtain tail distribution of the empirical covari- 
ances of autoregressive models. This turns out to be an application 
of the results of chapter 8. We will go as far as obtaining numbers 
useful for some real applications, doing numerical work. 

Chapter 12 deals with the distribution of the suprema of some 
stochastic processes. It contains some examples of pedagogical in- 
terest. It concludes with some calculations related to the supremum 
of the Brownian bridge and the supremum of an amusing process de- 
fined on the boundary of convex sets. 

There are two appendices. One deals with classical estimates on 
tail of Gaussian and Student distributions. In the other one, we prove 
a technical estimate on the exponential map. 

Every chapter ends with some notes, giving information and/or 
open problems on related material. 

When doing rather explicit calculations, we will focus on two 
specific families of distributions. Recall that the Weibull distribution 
on the positive half line has cumulative distribution function 1 — e~^" , 
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X ^ 0. Its density is ax"' ^"l[o,oo)(^)- We will consider a variant, 
namely the density 

where setting K^^^ = a^-(^/°^)/(2r(l/a;)) ensures that integrates 
to 1 on the real line. We call this distribution symmetric Weibull-like. 
This is the first specific family of distributions that wc will use. 

To introduce the second family, recall that the Student distribution 
has density proportional to (1 + a~^y'^)~^°'~^^^/^ . Its cumulative 
distribution function has tail given by the asymptotic equivalent 
— see Appendix 1 — 

Sai-x) I - S^{x) asx^oo, 

where Kg^a = /^(l + a^^y^)"'-"'''^'*^^ dy is the normalizing constant 
of the density. Accordingly, we say that a density Sa is Student like if 
the corresponding distribution function Sa{x) = Sa{y)dy satisfies 

Sa{—x) ~ 1 — Sa{x) ~ — — — as X — >■ OO , 

for some constant Kg^a- 

Why are we interested in these two specific famihes? When 
a — 2 the Weibull-like distribution is the standard Gaussian one. 
Embedding the normal distribution this way will allow us to see how 
specific the normal is. When looking at product densities, it is only 
when a = 2 that the Weibull-like distribution is invariant under 
orthogonal transforms; this will create discontinuities at a = 2 in 
some asymptotic approximations. 

The Student-like distributions are of interest because they are 
rather representative of the so-called heavy tail distributions. In 
particular, they include the symmetric stable distributions with index 
a in (0,2). But more generally, recall that a cumulative distribution 
function F is infinitely divisible if its characteristic function has the 
form 



/+00 ^tCx — 1 — sin a; , , , 
dM{x) 
-oo 



x'^ 



for some finite measure M — see, e.g., Feller (1970, §XVII). For x 
positive, define 



Jx r J-oo r 
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Recall that a function / on R is said to be regularly varying with 
index p at infinity if, for any positive A, 

lim:^ = A^ 

It can be proved — see, e.g.. Feller (1970, §XVII.4) — that whenever 

(resp. M~) is regularly varying with negative index, then 
1 — F{x) ~ M+(.t) as X tends to infinity (resp. F{x) ~ M~(x) 
as X tends to minus infinity). Thus, if 

M(— oo, —x) ~ M{x, oo) ~ 



with a > 2, then F is Student-like. More precisely, 

cia — 2) 

1 — Fix) ~ as ar ^ oo ; 

one can take the constant Kg^^ fo be c{a — 2)a~("+^)/^. 

Other distributions are Student-like. For instance, one may start 
with a random variable X having a Pareto distribution on (a, oo). 

We then symmetrize it. Take e to be a random variable independent 
of X, with 

P{e = -l} = P{e = +l} = l/2. 
Then eX has a symmetric distribution. Its tails are given by 

P{eX^x} = P{eX^-x}= ^ 



2[x-a + 1)" 

for X large enough. Thus, eX has a Student-like distribution. 

Notes 



There is an extensive literature on calculating integrals. Concerning 
the classical numerical analysis, quadratures, Monte-Carlo methods, 
etc., chapter 4 of the Numerical Recipes by Press, Teukolsky, Vetteling 

and Flannery (1986) is an excellent starting point. It states clearly 
the problems, some solutions, their advantages and drawbacks, and 
contains useful references. 
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The statistical literatTirc is full of more or less ad hoc methods to 
perform some specific integrations, sometimes related to those we are 
interested in here. In the last 20 years or so, Markov chain Monte- 
Carlo and important sampling methods have been blooming, bringing 
a large body of papers. Unfortunately, I do not know a reference 
explaining things simply. Perhaps this area is just too active and has 
not yet matured to a stage where classical textbooks become available. 

Concerning the approximation of univariate integrals, Laplace's 
method, asymptotic expansions and much more, I find Olver (1974) a 
great book. The classical references may be Murray (1984); De Bruijn 
(1958) and Bleistein and Handclsman (1975) have been republished 
by Dover. Both are inexpensive — as any book should be — and 
are worth owning. Not so well known is Combet (1982). Combet has 
proofs of the existence of asymptotic expansions derived from very 
general principles, although these expansions are not too explicit. I 
tend to believe that work in this direction would bring some practical 
results, and this is the first — loose — open problem in these notes. 

Regularly varying functions are described beautifully in Bingham, 
Goldie and Teugels (1987). 



2. The logarithmic 

estimate 



Throughout this book, we consider a nonnegative function / defined 
on M*^, convex, such that 

lim I{x) = +00 . 

|a;|^oo 

This assumption holds whenever we use a function /, and we will not 
repeat it every time. 

For any subset A of W^, we consider the infimum of / over A, 

I{A) =inf{/(x) :xeA}. 

When studying an integral of the form e~^^^Mx with A far away 
from the origin, a first natural question is to investigate if it is close 
to e~^^^\ at least in logarithmically. If this is the case, we have an 
analogue of the logarithmic estimate for exponential integrals 

log / exp ( — XI{x))dx ~ A/(^) as A ^ 00 
J A 

— is fixed here! — which holds, e.g., for sets A with smooth 
boundary. 

Our first result in this flavor will be a slightly sharper upper 
estimate than a purely logarithmic one, but for some very specific 
sets A, namely, complements of level sets of /. This estimate will be 
instrumental in the following chapters. 

Let us consider the level sets of /, 

Tc = (a; : /(x) ^ c}, c^O. 

The complement of F. in M"' is the set of all points for which / is 
strictly larger than c. Define the function 



L(c) = / e-^^'^Ux , c ^ . 
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2.1. PROPOSITION. There exists a constant Cq such that for any 
positive c, 

L(c) ^Coe-"(l + c)'^. 



Proof. Use Fubini's theorem and the change of variable u = v + c 
to obtain 



L(c) = / / {I{x)) dx e-" du = e-' [ e"'' \Tc+v \ Tc 

J J Jv^O 

Iv:: 



dv 



Since the function / is convex, nonnegative and tends to infinity with 
its argument, its graph 

{{x,I{x)) ■.xeM.'^} CM'^+^ 

is inchided in a cone with vertex the point (0, . . . , 0, — 1) and passing 
trough a bah centered at in M"^ x { }. 





Thus, I Ft I is less than the d-dimensional measure of the slice of the 
cone at height t, i.e., 

\Tt\^Ci{l + t)'^ 



^dv 



for some positive constant Ci . Thus, 

L{c)^Cie-^ [ e-^c + v + iy 

Jv>Q 
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and the result follows. ■ 

To handle more general sets, it is convenient to introduce the 
following notation. 

NOTATION. Let F , G he two functions defined on the Borel a-field 
ofM!^. We write 

(i) \im.A^oo = */ o-nd only if for any positive e, there exists c 
such that for all Borel set A of W^, the inequality I{A) ^ c implies 

\m)\^e; 

(a) liniA-^oo = ^ limA^oo G{A) = if and only if the 

following holds: 

Ve, 3d, 3c > 0, {I{A) ^ c and \F{A)\ ^ d) ^ \G{A)\ ^ e. 

Another way to phrase condition (ii) is to say that whenever I (A) 
is large enough and F{A) is small enough, then G{A) is small. 

Notice that this notion of limit depends on the function /. But if 
we restrict / to be convex, defined on and blowing up at infinity, 
then it does not depend on which such specific function we choose. 

The advantage of this notation is that it allows to express approx- 
imation properties related to sets moving away from the origin, but 
under some analytical constraints. We will mainly use (ii). We can 
similarly define liminfyi^oo F{A) and limsup^^g^ F{A). 

A first example of the use of this notation is to express a condition 
which ensures that we have a "nice" logarithmic estimate. It asserts 
that log J^e~^^^Mx is of order —I (A) provided A is not like a very 
thin layer attached to dFjf^A) ■ 

2.2. PROPOSITION. The following are equivalent: 

(i) limA^oo log / e-^(^)da; = -1, 

J A 

(ii) lime^o liminfA-^oo log \A n r(i+^)j(^)| = 0. 
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REMARK. For any set A, 

We then infer from Proposition 2.1 that 

limsup/(^)-Mog [ e-^^^'Mx ^ -1. 

Thus, statement (i) in Proposition 2.2 is really about the Hmit inferior 
of the integral as I{A) tends to infinity. 

Notice also that the the limit as e tends to in (ii) exists since e 
r(i_,_g)7(^-) is monotone. The limit is at most since l^dn r(i^.e)/(^) | ^ 
|r(i+e)/(A) I is bounded by a polynomial in I{A). Therefore, statement 
(ii) is really about the limit inferior as e tends to 0. 

Proof of Proposition 2.2. Assume that (ii) holds. For any positive 
number e, 



I{A)-Hog [ e--^(^)dx ^ /(^)-^log / 
J A J A 



^ i{Ar^ log (e-(i+^)^(^)|yi n r(i+.)/(A)l) 

= -1 - e + I{A)-^ log \A n r(i+,)j(^)| . 

Given the above remark, (i) holds. 

To prove that (ii) is necessary for (i), let us argue by contradiction. 
If (ii) does not hold, then there exists a positive (3 and a sequence of 
positive numbers converging to as A; tends to infinity, such that 

liminf /(^)-i log 1^ n r(i+,^)j(^)| < -13 

A — ^oo 

for any k large enough. For k large enough, < f3. Thus, using 
Proposition 2.1, 

e-^(^)da;^ j e'^^^^ dx + / e-^(^) dx 

^ e-'^^M n r(i+,^),(^)| + L((l + ek)I{A)) 
^ g-/(A)(l+/3) ^ ^^^-/(A)(l+.,) (1 + (1 + ek)I{A)Y 
^^IiA)il+e,+o{l)) as^^oo. 
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Therefore, 

limsup/(A)-^ log / e-^^"^) dx ^ -(1 + e^) < -1 
A—^oo J A 

and (i) does not hold. 

Notes 



The idea of the proof of Proposition 2.1, writing e"^ as the integral 
of e~" over [ /, oo) and then using Fubini's theorem is all but new; 
at most it has not been used enough. Licb and Loss (1997) call 
a much more general fact the "layer cake representation". I have 
the recollection of hearing talks using similar tricks and referring 
to the coarea formula. See Federer (1969) to know all about it, or 
Morgan (1988) to simply know what it is. Proposition 2.2 grew from 
Broniatowski and Barbe (200?) which we wrote, I think, in 1996 or 
1997. This second chapter has the flavor of large deviation theory and 
the notation / for the function in the exponential is not a coincidence. 
More will be said in the notes of chapter 5 and in chapter 7. There 
are by now a few books on large deviations. Dcmbo and Zcitouni 
(1993) and Dupuis and Ellis (1997) arc good starting points to the 
huge literature. From a different perspective, and restricted essentially 
to the univariate case, Jensen (1995) may be closer to what we are 
looking for here. 

Introducing the set Ac suggests that the variations of — log / arc 
important. It has to do with the following essential remark. The 
negative exponential function is the only one — up to an asymptotic 
equivalence — for which integrating on an interval of length of 
order 1 produces a relative variation of order 1 on the integral. 
Mathematically, the fact is that 

poo roo 

/ e'^ dx = / e"^ dx . 

Jt+s Jt 

Hence, in approximating e~^dx by e^^ dx, we are making a 
relative error equals to — 1. If s is fixed, this relative error stays 
fixed, event if t moves. If one writes the analogue formula with a 
power function, one obtains 

dx _ 1 _ / t Y dx 

Jt+s ax^ ~ (t + s)" ~ \t + s) Jt ax«+i ' 
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As t tends to infinity, the ratio + s))" — 1 tends to 0. Thus 
the relative variation of the integral tends to as i tends to infinity. 
Finally, in the subexponential case, for a > 1, 

/•oo roo 

/ ax^-^e--" dx = e-(*+^)"+*" / ax^-'e'^'' dx . 

Jt+s Jt 

The relative variation is now driven by 

exp(- 5^-1(1 + 0(1))) -1. 

If s is positive (resp. negative), it tends to —1 (resp. infinity) as t 
tends to infinity. 

In other words, the exponential scale that we use on the variations 
of / ensures that the variation of f{x) in this scale are of the same 
order of magnitude as the variations of x in space. 



3. The basic bounds 



In this chapter, our goal is to obtain lower and upper bounds for 
the integral J^e~^^^^dx. In order to get useful estimates, we need to 
decompose A into small pieces on which the integral can be almost 
calculated with a closed formula. These pieces are given by the 
geometry of the graph of the function /, and therefore we will first 
devote some time in introducing a few useful quantities related to this 
graph. 

3.1. The normal flow and the normal foliation. 

Recall that we assume 

/ is strictly convex on M'^, nonnegative, twice differentiable, 
with lima._^oo — +00. 

Up to adding a constant to 7, which amounts to multiplying the 
integral e~^^^Mx by a constant, and up to translating Ahy a fixed 
amount, we can assume that 

7(0) = 0. 

Let D7 denote the differential — or gradient — of 7 and D^7 its 
Hessian. The strict convexity assumption implies that D^7 is a 
symmetric definite positive matrix. 
Recall that we use 

Tc = {.X G M"' : /(.x) c} = r^{[0,c]) 

to denote the level set of I. We define the level lines — or level 
hypersurfaces — of 7 as the points on which 7 is constant, that is 

Ac = { X e M'^ : I{x) = c} = r\{c}). 

Since 7 is a convex function, Fc is a convex set. Moreover, 7 being 
defined on W^, we write 

c>0 
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Let X be a nonzero vector in M!^ — equivalently, I{x) is nonzero. Set 
c = I{x). The gradient D/(a;) is an outward normal to Ac at x and 
defines a vector field on M"^ \ { }. For t nonnegative, we write ip{x, t) 
the integral curve of the vector field D7, such that i^{x,0) = x and 
l['il}{x,t)) = I{x) + t. Sometime it will be convenient to use the 
notation i/jtix) or ipx{t) instead of ijj{x, t). 

DEFINITION. The flow tpt is called the normal flow at time t. 




X, the flow 4'{x,s) for s in [0, t], 
the level sets A/(^) and A/j^j+t hilighted. 

It is also convenient to introduce the outward normal unit vector 
to Ac at X, 



3.1.1. LEMMA. The function t ip{x,t) is a solution of the 
differential equation 

dt"^ ' |D/(^(x,t))|" 
with initial condition '4>{x, 0) = x. 
Proof. By definition of the normal flow, 

1 = t)) = (d/(v(x, t)) , ^^^Pix, t)) . 

The result follows since ■^ip{x, t) must be collinear to D/(^(a;, t)) . ■ 
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The next trivial fact is due to the convexity of /. It asserts that the 
norm of the gradient |D/| can only increase along the normal flow. 

3.1.2. LEMMA. The function t \D I (^il^{x,t))\ is nondecreasing. 
Proof. Since D^/ is positive definite, 

^|D/(V'(x,t))|' = 2(D2/(V;(x,t))7V(V'(a;,t)),7V(V(x,t))> ^0. . 

The monotonicity of |D/| along the normal flow yields a Lipschitz 
property of the map 1 1— il)(^x, t). 

3.1.3. COROLLARY. For any x in M'^ \ {0} and nonnegative real 
number t, 

\i;{x,t)-i^{x,0)\ = \'i/j{x,t)-x\^ * 



\BIix)\ ■ 



Proof. The result follows from the fundamental theorem of calculus, 
Lemmas 3.1.1 and 3.1.2, since 

.* ds 

ds 



\i^{x,t)-ij{x,0)\ ^ I ^^Hx,s) 











\Bl{i;{x,s))\ 



t 



\DI{x) 



As a consequence, two points on the same integral curve and nearby 
level lines of / cannot be too far apart in Euclidean distance. This is 
expressed in the following result 

3.1.4. COROLLARY. Let M be a positive number. If y is in the 
forward orbit of x under the flow ^'t; i-e., 

ye{Mx)-t^O}; 

and if I{y) — I{x) is m [0,M], then 

M 
\DI{x)\ 
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Proof. Since y = ilj{x,t) for some t, 

M ^ I{y) - I{x) = I{ij{x, t)) - I{ij{x, 0))=t. 
Hence, Corollary 3.1.3 implies 

t M 
\y -x\^ . ^ 



\DI{x)\ ^ |D/(x)| 

So far, we have just collected some trivial informations on the 
normal flow. Since / is twice differentiable, the level lines Ag are 
submanifolds — hyper surf aces — of R'*. The function ■^j maps Ac 
onto Ac+f For p in Ac, we can consider the tangent space TpAc of 
Ac- The differential of tpt at some point p, which we denote either by 
Biptip) or tlJt*{p), maps Tpk^^p) to T^^(p)Ai(p)+f We will need some 
estimates on this differential, and we first calculate its derivative with 
respect to t. 

3.1.5. LEMMA. The following holds 



d 



D2/ 



Proof. Lemma 3.1.1 yields 



by the chain rule. Therefore, 

This is the result since AT = D//|D/|. ■ 

Since / is convex, the level sets Fc are convex, and one can see by 
a drawing that ipt* must be a map which expands distances — see 
the picture before Lemma 3.1.1. This is expressed in the next result. 

Before stating it, notice that for p in Ac, the tangent space TpAc inherit 
of the Hilbert space structure of W^. Thus, it can be identified with 
its dual (TpAc)*. Since V't* maps T^Ac to T^^(p^Ac+t, its transpose 
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maps (T^^^(p)Ac+t)* = T^^(j,)Ac+t to TpAc. Consequently, V'SV't* is a 
linear map acting on TpAc. 

3.1.6. LEMMA. For any nonnegative t, det(V'J'i/'t*) ^ 1- 

Proof. Since ipo is the identity function, we have det('(/'o*) = 
det(Id) = 1. It is then enough to prove that for every h in TpAc, 
with c = I{p), the map 

is nondecreasing. 

Writing q = tptip)y we infer from Lemma 3.1.5 that 

\i^t*{p)hf = 2({ld-2N0N{q))^{q)^/Jt4p)h,^/Jt4p)h) • 



Since 'tpt*h belongs to TqAc+t it is orthogonal to N{q). Therefore, 

N{qf,Pt*{p)h = {N{q) , A*{p)h) = . 
Since D^/ is nonnegative, it follows that 

j^\^t*{p)h\^ = 2(^(g)V't*(p)/i,V't*/i) ^0. 

Consequently, the given map is indeed nondecreasing. ■ 

It follows from Lemma 3.1.6 that is invertible. Since t ipt 
a semigroup, ipt* cannot expand too fast as t increases. 



3.1.7. LEMMA. For any p in'R'^\{0} and any nonnegative s, t, 

\B^l{i;t+u{p)) 
\Bl{^t+u{p))f 



k+sAp)\\ ^ Ut*{p)\\ exp (^J^ ||D'^^^(V'i+4y))|| ^^ j _ 



Proof. Notice that Id — 2N (g) is an inversion. Its operator norm 
is 1. Using Lemma 3.1.5, writing q = ijjt{p) and using the triangle 
inequality for the increments of the function t ||V't*(p)||) we obtain 



d_ 
di' 



d 



D2/ 
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Integrate this differential inequality between t and s to obtain the 
result. ■ 

In the same spirit as in the previous lemma, a little more work 
gives us a good control on the growth of det(^^'0t*)) improving upon 
Lemma 3.1.6. 

3.1.8. LEMMA. For any nonzero p and any nonnegative t, 
d 



— logdet(V't*^t=,(p)) 



\^i{Mp))\' 



tr 



In particular, for any nonnegative s, 

det(V't+s,*V't+s,*(p)) 

^ det(V't*V't*(p)) exp 2 / / — .m2 
V Jo |D/(V't+„(p))| 

and the function s i— > 'i^^{'^J+s,*'^t+s,*{p)) nondecreasing. 

Proof. Let p be a nonzero vector. Consider a local chart p{-) : 
U C M"^"^ ^i{p) around p, such that p{0) = p and the vectors 
di = ^p(O) form an orthonormal basis of TpA/^p). Lemma 3.1.1 
yields 



d d"^ 



u=Q 



u=0 



Since ilJudj belongs to T^j(p)A7(p)+t, it is orthogonal to N(^pt{p)) for 
all j. Moreover, dA^ is self- adjoint when acting on the tangent space 
of the level curves A/^p-j^.j — a known fact when studying the second 
fundamental form of immersions of hypersurfaces in W^; see, e.g.. Do 
Carmo (1992) — we obtain 

^{tpt*{p)9i,tpt*dj) = ^^^^^^^^^^^ (^dN{i;tip))Mp)di,i^t*{p)dj^ ■ 



3.1. The normal flow and the normal foliation 



31 



Using that Ddet(x) = dct(.T)tr(x ^•) — a proof of this fact is in 
Lemma 10.0.1 — and that i/^t* is invertible, 



dt 



|D/(Vt*b))| 



\t>i{Mp))\ 



dN{Mp)) 



This is the first assertion in the Lemma, since the restriction of dA'^ 
to the tangent space coincides with D^//|D/|. Moreover, since D^/ is 
positive, we have 



tr 



Proj 



7V((p)Aj(p)+t 



dN{Mp)) 



tr[D^/(V>,(p))] 

\^i[Mp))\ 



and the second statement follows from integration. 

The third statement follows from the first one. 
nonnegativc, and so ^ logdet(^^'0j^(p)) ^ 0. 



Indeed, D^/ is 



We now have all the informations we need on the normal flow. 
Its integral curves define a foliation of M'^ \ { } . We will obtain 
an expression for e~^^^Ma; by integrating first on A/(^), and then 
along the leaves. The virtue of the normal flow is that computation 
of the Jacobian in the change of variables is easy, because the normal 
vector field and the level sets of the function / are orthogonal at every 
point of W^. Before rewriting the integral, we need to parameterize 
the boundary dA in the flow coordinate system. Speciflcally, for any 
point p in M'^, deflne 



rA(p)=inf{t^O:^t(p)G^}, 

with the convention that inf = +oo. In the sequel, we will always 
read e~°° as 0. 



32 



Chapter 3. The basic bounds 




^l/{x, ta{p)) 



p, the flow i/'(p, s) for s in [0, ta{p)], 
the level sets A/(4) and f^i(A)+TA(p) hilighted. 

It is convenient to agree on the following convention: Except if 
specified otherwise, we will equip submanifolds of with the inner 
product of M*^ on their tangent spaces. This defines their Riemannian 
measure completely — not only up to a multiplicative constant. 



NOTATION. Whenever a set M is a submanifold of \ 
■Mm for its Riemannian measure. 



we write 



We can now rewrite our integral. 

3.1.9. PROPOSITION. // I{A) is positive, the following equality 
holds, 



^^(-)da 



-1(A) 



-TA (P) 



s^O pl{^rA{p)+s{p))\ 

|det(V'J^(p)+,^>^^(p)+,_,(p)) |'/'dsd7WA,(^)(p) . 



/(A) 



Proof. Use Fubini's theorem to first obtain 



j^e ^(^)da; = j j lA(a;)I[7(^),oo)(c)e " dcdx 

= e-^(^)y" e-Vnr,(^)+,|dc. (3.1.1) 



This brings the leading exponential term out of the integral. 
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Let p be in A7(^) , and consider a local chart 
p(-) : U C R"-' ^ A,(^) 

around p. If x is in the image of p{U) through the normal flow, we 
can parameterize it as x = '4>(^p{ui, . . . , s) . Setting di = -^p, 

the Jacobian of the change of variable x {ui, . . . , Ud-i, s) is 



J 



= det 



det(^Vs*(p)5i, • • • , ips*{p)dd-i, ^V'b, s)) 

OS 

(det(V'J.V'.*))'^'det((a,,5,),^..^,_J^/\ 



1/2 



d 
ds 



since {ips*{p)9i , ^VXf'. •'^)) = 0, for i = 1, . . . , d - 1. 

In term of the new parameterization and gluing charts using a 
partition of unity, we obtain 

\AnTi^A)+c\ = J lAnr,(^)+,(a;)da; 

= / / U{Hp,s)) -^^p{p,s) X 

det(CV'.j'/'d7WA,(^)(p) ds. 

Notice that I^(?/;(p, s)) vanishes if s ^ ta{p)- Using (3.1.1) and 
Fubini's theorem, the expression of <iip{p, s)/ds in Lemma 3.1.1 gives 
the result. ■ 

Just as we introduced ta {p) , the first entrance time of a point in A 
through the flow, define 

Xa{p) = sup { s : i^rA(p)+sip) ^A} 
the last time of exit of A starting the clock time at ta{p), and 
Xa{p) = inf { s > : V'rA(p)+s(p) ^ A} , 

the first time of exit of A starting the time at ta {p) ■ If A has "holes" , 
Xa be strictly less than Xa- Both XAiP) ^^'^ XAiP) be 
infinite. 
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Proposition 3.1.9 and our lemmas on the normal flow yield the 
following basic bounds, which will turn out to be surprisingly sharp 
and useful. 

3.1.10. THEOREM. For any Borel set A ofR'^ with I{A) positive, 



/ 



-ta(j>)_ 



|D/(Vr,(p)(p))| 

f ( ||D^/|| \ 

'''^^ \ {'4^rA{p)+t{p)) - 1 j dt 

dsdA4A,(^)(p) 



and 



ta{p)<oo 



/ exp / ( 



|D/(V.,(p)(p))| 
■IID^/II \ 

J^^2-i^rA{p)+t{p)) - ^ Jdt 



(Be aware that the letter d in front of ||D^/||/|D/| in the exponential 
in the upper bound refers to the dimension d of W^, and not to some 
differentiation ! ) 

Proof. The upper bound follows from Proposition 3.1.9, and the 

following observations. Clearly, lA{'ipTA(p)+s{p)) is at most 1, and 
vanishes whenever s ^ XAiP)- Lemma 3.1.8 implies 

d^^{^rA{p)+s,*i'rA{p)+s,*) 

and Lemma 3.1.2 gives 

1 1 

|D/(^,^(,)+,(p))| ^ |D/(V;,^(,)(p))| • 



3.1. The normal flow and the normal foliation 



35 



To prove the lower bound, notice first that lA{'iprA{p)+sip)) — 1 
s in [0,Xa{p))- Using Lemma 3.1.8 and Proposition 3.1.9, 



det(^T(^)>^^(^)(p)) 

O I / y , \ I 



|D/(V',,(p)(p))| 



/ exp ( - S - log pl{'iljTA(p)+s{p)) I + log |D/(V'r^(p)(p)) I 

dsdMAj^A)iP)- 

Using Lemma 3.1.1, we have 

A log |d/(^(p, .)) I = (^^,n) {^Pip, s)) . 

Therefore, 

log |D/(V'^^(p)(p)) I - log \Tyi{iprA{p)+s{p)) I 

and this brings the lower estimate. ■ 

REMARK. The gap between the upper and the lower bounds comes 
essentially from the term ||D^/||/|D/p in the exponential. One should 
expect this ratio to be small for large arguments and in interesting 
situations. For instance, when d = 1 and I{x) = |a;|°, we have 

|D2/(:r)| a-1 1 



|D7(a;)|2 a ' 

In the same vein, if ta{p) is not too large, we should be able to replace 
tpTA(p)iP) by i^oip) = P, while if ta{p) is large, the term e~'^^^^ will 
make the contribution of those p^s negligible. Therefore, we hope to 
obtain the approximation 



-TAip) 



e 



\BIip)\ 



J^^ exp(^J -ldi)dsd7WA^(^)b) 
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which is a quite manageable expression. To prove that such approxi- 
mation is vahd requires some ideas on the order of magnitude of the 
final expression in (3.1.2), and the next section is devoted to the study 
of this term. 

3.2. Base manifolds and their orthogonal leaves. 

Following the remark concluding the previous subsection, we want to 
rewrite the integral as e~^*^"^^ times 

so that we can isolate the leading terms. We first need to recall some 
notation and introduce some definitions. 

For p in Ac, we denote by cxpp(-) the exponential map at p in the 
manifold Ag. That is, if u belongs to TpAc, the value of cxpp(n) is the 
point on Ac at a distance |u| to p on the geodesic starting at p in the 
direction u — see for instance Do Carmo (1992) or Chavel (1996). The 
exponential map is always defined for a small value of the argument 
in the tangent plane. 

If M is a submanifold of ^k^a) p is a point in Af, the tangent 
space rpAj(^) splits as TpM (TpM)-*- where we denote by (TpM)-*- 
the orthocomplement of TpM in TpAj(^Ay 

DEFINITION. The projection of a set A C on Aj(^) — through 
the flow ip — is the set of all points p in Aj(^) such that ipt{p) belongs 
to A for some positive t. A submanifold T>a C A/(^) is called a base 
manifold — for A — if the projection of A through ip is contained in 

Often, we will loosely speak of the projection of A on A/(^), 
forgetting to add that it is through the flow tp. 

In what follows, if Va is a base manifold, we denote by A; = k{VA) 
its dimension. In order to replace the integration over Aj(^) by an 
integration over a base manifold Va in (3.2.1), we need to attach 
orthogonal leaves to T>a- For this construction, we borrow some 
definitions and notations used to parameterize tubes as in Weyl's 
(1939) formula. 

Consider a base manifold T>a of A, and define a normal bundle by 

NpVA = TpAj^A) e TpVA , peVA. 
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Let dist(- , •) denote the Riemannian distance on Aj(^j^y For every u 
in NpT>Ai consider the radius of injectivity of A/(^) in the direction it, 

eA{p, u) = sup { t ^ : dist(expp(i«),p) = i } , 

the exponential map being that on Ai(^a) ^ before. Define 

^A,p = {{t,u) : u e NpVA , |it| = 1 , ^ t < EAip, u)] , peVA, 

so that [JpQX>A '^^A,p is the set of aU focal points of Va immersed in 
A/(^). From its definition, we infer that expp(f2^_p) coincide with the 
set 

'^A,p = {q & ^li^) • there exists a unique minimizing 

geodesic through q which meets Va orthogonally at p}. 

Set 

On UAi we can define a projection vr^ onto T>a as follows. Any 
point q in uja can be written in a unique way as g = expp(M) for 
{p-,u) G Va X NpT>A- We set nA{q) = P- In other words, q is 
on a unique geodesic starting from "Da and orthogonal to Va] the 
projection of q on Va is the starting point of this geodesic in Va- We 
call the sets {7r^^(p) : p € Va}, the orthogonal leaves to Va- By 
construction, u)A,p is in A.j(^a) = ^lip)^ ^iid it contains ir^^^p). 




In order to rewrite the integral (3.2.1) as an integral over a base 

manifold and its orthogonal leaves, we need to calculate the Jacobian 
of the change of variable q G ua expp(ti), p G Va, u £ NpVA- 
For this purpose, notice that the differential tta*{q) maps TqAj(^A) 
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onto T^(q)PA- Since tta is constant on the orthogonal leaves, the 
orthocomplement of ker 7ryi*((j') is a vector space of dimension k. Let 
bi{q), . . . , bk{q) be an orthonormal basis of this orthocomplement, and 
define the Jacobian 

JiTAiq) = det(^{nA*{q) A ... A TTA*{q)){bi, . ■ . 

— when A; = 0, read Jt:a{q) — 1- Fcdcrer's (1959) co-area formula — 
see the appendix of Howard (1993) for a simple proof in the smooth 
case we are using here — yields 



DI{q) 

{J7rA)-\q)dM^-.^^-^{q)dMvAp)du- (3-2.2) 

We can now express the Riemannian measure Al^-i^^^ over the leaf 

^^^{p) in normal coordinates. For this aim, following Chavel (1993, 

chapter 3), let Tp^t,v denote the parallel transport along the geodesic 
from p to expp(to), and furthermore, let Rq denote the Riemannian 
curvature tensor of the leaf vr^^ o T^Aiq) at the point q. Denote 
lp,v{t) = expp(tt;) a point of the geodesic starting from p in the 
direction v with constant velocity \v\. Then, consider the matrix 

ny{t) = T~^l^ o i?expj,(to) {ip,v{t)^ ■)ip,v{t) ° %,t,v 

defined on Tp'K']^{p). We define a matrix- valued function Ap{t,v) as 
solving the differential equation — in the set of matrices over the 
{d — k — 2)-dimensional vector space TpTTj^{p) QvR = NpVA QvR — 

^A{t,v) + n^(t)A{t,v) = Q , 



subject to the boundary condition 

) 

= Id. 

The boundary conditions imply 



^(0,t;)=0, J^Mt.v) 



dciA{t, v) dct(t Idffid-fc-2) r^t'^ ^ as t -> 
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uniformly in v in the unit sphere Sj, ^-i^p-^{0,l) of NpVA- The 
expression of the Riemannian measure over a leaf, Ai^-i^^y in normal 
coordinates yields 

f _^ f f f I[0,«](^A(cxpp(tt;))) 

Ju^o JpeVA Jves^^^-i^^^ Jtelo,eAip,v)] |D/( expp(to)) | 
1 



I 7 ^ .—jdct{Ait,v))dtdfipiv)dMvA{p)du, (3.2.3) 

where ftp is the Riemannian measure over the unit sphere of TpTT^^{p), 
centered at the origin. 

This last expression looks quite complicated. However, we are 
almost done, and the intuition goes as follows. Roughly, we want 
to choose T>A as ^i(a) ^ 9 A, so that / is minimal in A over Va — 
but we will actually need to have a little bit of freedom for some 
applications and make a slightly more subtle choice. Due to the term 
e~", let us concentrate on the range u = 0(1). The assumption 
|D/(j))| tends to infinity with \p\ will imply that in good situations 
rA(expp(if)) grows very fast as a function of t, since exp(tf) is 
transverse to Da where ta is minimal. Hence, if the indicator function 
of TA{ex.pp{tv)) ^ u = 0(1) is not zero, we must have t small. But 
for small f s and p G Va, 

{jTTAr\cWp{tv)) ^ {jTTArHp) = 1 

since expp{tv) « expp(O) = p and 7r^(p) = p; furthermore, as we 
mentioned earlier, for small i's 

det{A{t,v)) 

and again, since ex.pp{tv) fa p, 

\Dl{expp{tv))\ \BI{p)\. 

Thus, we should expect the right hand side of (3.2.3) to be approxi- 
mately 

f e-- I I 
Ju^o JpeVA \^^ip)\ Jves^ -I, Jo,i) Jte 



i(-oo 



,„] (TA{expp{Lv)) - ta{p)Y'''-^ dtdnpiv) dMvA^P) ■ (3-2.4) 
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Assuming that dA is smooth, for a unit vector v in Tpvr^ (p), the 
function t i— > Tyi(exp(tt;)) — ta{p), is minimum for t = 0. This 
function should be approximately quadratic near 0. The integration 
in t and v then gives the volume of an ellipsoid, which is related to the 
curvatures of ^i(a) and dA near p. We will also need to prove that in 
good situations (3.2.1) is equivalent to (3.2.2), in which we restricted 
the integration to cj^. 

At this stage it should be noticed that pinching the curvature R 
yields differential inequalities for A and therefore bounds for (3.2.3). 
We shall not pursue this line, but the reader may notice that there are 
situations where R is easy to calculate — for instance if I{x) = jzp 
as in the Gaussian case, then R = Id/|a;| — or to pinch. Thus, more 
precise estimates could be obtained, and even a control of the error 
terms. This could be useful in some applications, but it is not clear 
that it is worth investigating in a general setting. 

Notes 

This chapter builds upon the classical theory of STU'faces and integra- 
tion on manifolds. If you don't know any differential geometry, don't 
give up! It took me a long time to find a good starting point, that is a 
book that I could read and understand. I found it when I was visiting 
the Universite Laval at Quebec! Buy Do Carmo's (1976, 1992) two 
books, and start reading the one on curves and surfaces. If you are 
as bad learner as I am, do what I did, that is, all the exercises. Once 
you read about two dimensional surfaces and understand that curva- 
ture is a geometric name for a second order Taylor formula, you will 
have enough intuition to digest the abstract Riemannian manifolds — 
which really copy the classical theory of surfaces in M'^ . After reading 
Do Carmo's books, I found Chavel (1996) and some parts of Spivak 
(1970) most valuable. Some colleagues liked Morgan's (1992) book 
very much as a starting point, others McCleary's (1994). 

The change of variable comes from that in Weyl's (1939) tube 
formula. Weyl's paper is very nice to read, and a good part does 
not require much knowledge of differential geometry. 

Notice that the formula derived in Proposition 3.1.9 does not rely 
upon convexity of /. It could be of some use in other places. 

It is certainly possible to extend the estimates obtained in this 
chapters, the previous ones and the next ones, to some integral over 
noncompact manifolds. In this case, one should replace the Lebesgue 
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measure by the Riemannian one. I do not know if this could be of any 
use. 



4. Analyzing the leading 
term for some smooth sets 



In this chapter we analyze further the quantities involved in the 
integral (3.2.3). We will assume that 

dA and ^i(a) are smooth — i.e., — submanifolds of R^, (4.0.1) 

and furthermore that 

the base manifold Da is a smooth — i.e., — / , „ 

(4.0.2) 

submanifold of R , possibly with boundary. 

Notice that so far wc have a lot of freedom to choose the base manifold, 
so that (4.0.2) is not much of a restriction. Borrowing from the theory 
of large deviations, it is convenient to introduce the following notion. 

DEFINITION. A base manifold Va for A is called a dominating 
mMnifold (for A) if, for allp in Va and all unit vectors v in Tpn^^{p), 
the function 

te {- eA{p,-v),eA{p,v)) TA{expp{tv)) e [0,oo) 

has a local finite minimum att = 0. A point in a dominating manifold 
is called a dominating point. 

Thus, if Da is a dominating manifold, the set dA is pulling away 
from A/(p-) along the lift through the normal flow of the geodesies 
orthogonal to T>a- Notice the important fact that if a point is in a 
dominating manifold T>a, it may not be in OA, i.e., ta{p) may still be 
positive. This is an essential difference from the large deviation theory, 
or analogously to what would be considered in Laplace's method. This 
distinction will turn out to be crucial for some applications — see 
sections 8.2, 8.3 or 10.2. The downside of allowing this extra freedom 
is that the main result has a slightly more involved statement. But it 
is worth the extra power provided. Another feature is that only the 
points for which TA{eyipp{tv)) is finite matter. 
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4.1. Quadratic approximation of near a dominating mani- 
fold. 

Under the assumptions (4.0.1)-(4.0.2), HVa is a dominating manifold 

and p is one of its points, the function t ^ ta[ expp{tv)^ is minimal at 
for all unit vectors v tangent to tt^^{p) at p. It admits a quadratic 
approximation. To obtain it, let us denote by nA,,,^ (resp. IidA,q) 
the second fundamental form of Ac (resp. dA) at g G Ac (resp. 
q G OA). Those are defined using the unit outward normal vector 
N to Ac. At points of A/(^) n dA, this unit normal N is also a unit 
normal for dA, and so dA is oriented by an extension of A^. The 
submanifold 7r^^(p) C A/(^) C admits a second fundamental form 
also associated with the normal field A''. For q belonging to Tr^^{p), 
we denote it by IIT „. It is nothing but the restriction of Ha,, ., „ 

to TqTT2^{p). The leaf 7r^^(p) can be lifted to OA through the normal 
flow in considering the set 

ilJA{TrJ^{p)) = {'<lj{q,TA{q)) , q e TTj'^ip) } . 




On this picture, the line leaving the point q is the normal flow (//(q, t) for 
t ^ 0. It crosses the boundary dA at the circled point ip{q, T/\(q)). When 
q moves on the geodesic 7r4^(p), the crossing point describes i/'A(7r~^(p)). 

This lifted leaf admits a second fundamental form relative to the 
extension of A^ on dA. At any point p in Dj^ for which ta{p) 
vanishes, this second fundamental form is the restriction of IldA,p 
to the tangent space of the lifted manifold, which coincides with the 
tangent space Tp'K'2^{p) = NpVA- We denote by Hg^^ this restriction. 
The difference of the fundamental forms, nj^^^^^j p — ^qa p ^e 
interpreted as follows. Assume we live on the leaf 7r^^(p) C Aj(^a)j 
and look at dA along the "vertical" direction given by the normal 
flow. From tt^^ (p) , the boundary dA pulls away as we move away from 
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p £ TT^ {p)r\dA. The difference of the fundamental forms is a measure 
of the curvature of dA viewed from Tr^^{p), with vertical distance 
measured in Euclidean distance along the normal flow. However, as far 
as the integration goes, the right measure of vertical distance is on the 
increments of the function /. At an infinitesimal level, the increment 
is l/|D/(p)| near p — sec Lemma 3.1.1. Thus, in the geometry of the 
level sets of /, the bending of dA away from ^i(^a) is measured by 

Equipped with this interpretation, the following result is very 
natural. 

4.1.1. PROPOSITION. Let p{s) be a curve on Tr^^{p), such that 
p{0) = p belongs to dAnAj^^A)- Under (4.0.1)-(4.0.2), 

rA{p{s)) = y |D/(p)K(nX^^^^,, - ng^,^y(0) ,;,'(0)> + o{s') 
as s tends to zero. 

Proof. We will make use of the following elementary fact. If q{s) is 
a curve on a surface M and z/ is a unit normal vector field on M, then 
= 0. Differentiating with respect to s, 

{Mq)q',c^) + {v{qU")=Q. 

Since p belongs to 9>ln A^(^), the function q G Aj(^) TA{q) £ M 
is minimal at p. Hence, T)ta{p) is collinear to Y)I{p). Differentiating 
at s = the relation TA{q) — s = TA^'ipiq, s)) yields Dta{p) = — D/(p). 

Next, let p{.s) be a curve on A/^y^) such that p{0) = p. Let t = p'{0) 
be its tangent vector at p, and set v = p"{0). Define 

ei = -{BI{p),v) + {B\A{p)t,t). 

A Taylor expansion yields ta{p{s)) = s'^6\/2 + o(s^). To prove 
Proposition 4.1.1, we just need to find an expression for 6i. Denote 

g{s)=,l^[p{s),TA{p{s))]=^{p{s),sHi/2 + o{s'')) , 

the curve p{s) lifted to dA through the normal flow. A Taylor expan- 
sion and Lemma 3.1.1 gives 

g[s)=p + St+-(v + ei—-{p)'j+0{s''). 
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In particTilar, the curves iP[p{s),ta{p{s))~\ and p{s) have the same 
tangent vector t at ,s = 0. Writing for a unit normal vector field 
extending N(p) on dA, using the Weingarten map, and the elementary 
fact at the beginning of this proof, 

{naA,pt,t) = {du{p)g'{0),t) = -(1.(0) , /(O)) 

This gives an expression for 6i involving 

{N{p),v) = {N{p),^p{s)l^^) = -{dN{p)t,t) = {llA^,^„pt,t) 
— using the elementary fact again — and ultimately the result. ■ 
4.2. Approximation for det A{t,v). 

In this section we obtain some bounds on the determinant of the 
matrix A{t,v) involved in formula (3.2.3). They will be instrumental 
in approximating (3.2.3) further. These bounds are given by the 
inequalities of Gunther (1960) and Bishop (1977) — see, e.g., Chavel 
(1993, pp. 118-121). They involve the curvature tensor of the surface 
7r^^(]?) C A/^p) and so we will first compute that of the level set Ag. 

In order to avoid any ambiguity, recall that if .E is a subspace of 
W^, the compression of a d x d matrix M to is the linear operator 
from E into E obtained in restricting M to E and then projecting the 
image of M into E. Thus, writing Proj^ for the projection from 
to E and \e for the restriction to E, the compression of M to is 
ProjEoMle. 

4.2.1. LEMMA. Let S{p) be the compression of I)'^I{p)/\l)I{p)\ to 
TpAji^p-^ . The curvature tensor of the surface A/(p) at p is given by 

R:x,y,ze TpAj(^p) ^ R{x, y)z 

= {S{p)x,y)S{p)y - {S{p)y,z)S{p)x G TpAi(^p) . 



Proof. Let Projj^^^^^^ denote the orthogonal projection from R*^ 
onto TpAj^py Furthermore, we write V for the standard Riemannian 
connection on W^. For any x in TpA^^-^, 

-^mT,K,,,,y.N{p) = ^—Proj^^^ V.D7(p) = S{p)x. 



4.2. Approximation for det,4(t, v) 



47 



It follows that the second fundamental form of C M is the 

bilinear map 

B:x,ye TpAi^) ^ B{x, y) = {S{p)x, y)N{jp) e M*^ G Tpki^^ . 

The result follows from the immersion of ki(p) in the space of null 
curvature, and say, Theorem 2.2 in Chavel (1993). ■ 

We now obtain an expression for the sectional and the Ricci 
curvature of ki(p) • 

4.2.2. LEMMA The sectional curvature of the surface Ai{p) at p is 

K ■.x,ye TpAj(p) K{x, y) 

^ {S{p)x,x){S{p)y,y) - {S{p)x,y)^ ^ ^ 
|x|2|y|2 - (a;,y)2 

Its Ricci curvature is 

Rice : x,y e TpAj(^) i— >■ Ricc(a;, y) 
= {{S{p)x,y){S{p)ej,ej) - {S{p)x,ej){S{p)ej,y)) e R, 

where ei, . . . , ea-i is any orthogonal basis ofTpkj^y 

Proof. The expression of the sectional curvature follows from The- 
orem 2.2 in Chavel (1993) say, and the calculation of the Weingarten 
map Projj-^^^^^j V^^A^ in the proof of Lemma 4.2.1. The expression for 
the Ricci curvature follows from Lemma 4.2.1. ■ 

We can now pinch — i.e., bound above and bellow — the cur- 
vatures, and for this pTuposc, let us denote Amin(D^-^(p)) and 
Amax(D^-^(p)) the smallest and the largest eigenvalues of the sym- 
metric definite positive d x d matrix D^I{p). 

4.2.3. PROPOSITION. For any x in TpAj^), 

Ricc(x,x) ^ ('^-2)^^^|^^yp^kr 
If x,y are in TpAj^^) and are orthogonal — i.e., {x,y) = — then 
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Proof. Since D^/ is a symmetric definite positive matrix, so is S{p). 
Let ^ si ^ • • • ^ Sd-i be the eigenvalues and ei, . . . ,ed-i be 
the corresponding orthonormal basis of eigenvectors of S{p). Let us 

denote by z = Xli^i^d-i ^i^i ^^'^ V = Y^i^^d-iVi^i vectors in 
Tp Aj(p) . We then have 

Ricc(a;, a;) = ^ ^ ^ xjsiSj — x'jsfj 

= XI ^hi(^^^{S{p)) - 

If Sd-1 ^ tr(S'(p))/2, the function s G [si,Sd„i] ^ .s(trS{p) — s) is 
increasing on [si,trS'(p)/2], decreasing on [tTS{p)/2, s^^i]. 

Otherwise, if s^-i ^ tiS{p)/2, this function is increasing. Thus, in 
any case, for any i = 1, . . . , d — 1, 

Si(ti:S{p) - Si) ^ si[trS{p) - si) A Sd-i{tiS{p) - Sd-i) 
>{d- 2)sl . 

Since S{p) is a compression of D^/(p)/|D/(p)|, the smallest eigenvalue 
of D^/(p)/|D/(p)| is larger than si. The bound on the Ricci curvature 
follows. 

To obtain an upper bound for the sectional curvature, notice that 
if X and y are of unit norm and orthogonal, then 

K{x,y)= X Sixj X Siyj - ^ Xiyisi^ ^ Sd-iSd-i 

^A,„ax(D2/(p))7|D/(p)|2, 

where the last inequality comes from the fact that S{p) is a compres- 
sion of D2/(p)/D/(p). ■ 

Let us denote by ui]^-if^^-^{p) the radius of injectivity of p in the 
manifold Tx^{p). Since ■nj^{p) is built of geodesies of Aj(p), we have 

where c(p, u) is the Riemannian distance from p to its cut point on 
^i[p) along the geodesic leaving p in the direction u — notice that this 
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property on inj^-i^p^(p) is very specific to the way we constructed the 
leaves, and to the fact that we only consider the point p. 

From the previous lemmas, we can deduce some bounds on 
det^(t,i'), using some classical volume comparison theorems. 

4.2.4. PROPOSITION. (i) ( Bishop- Gunther) Assume that for any 
unit vector v in TpTr^^(p), the Ricci curvature along the geodesic 
expp{tv) in tt^^{p) is nonnegative. Then, for allt in [ 0) inj^-i(p-)(p)) , 
the inequality det^(t, v) ^ td-k-2 f^^i^g 
(a) For any positive t, let 

^max(p, t) = sup I • « ^ '^A'iP) ' d{p, q) ^ 



Then, for any positive to and any t in 
deiA{t,v) ^ 



\/ Krnax{p,to) 
Sin(^Krnax(p,to)i^'"'"''"^ 



V-f^maxb, to)) 



Proof. (i) is Bishop's (1977) or Giinther's (1960) theorem. Assertion 
(ii) follows from Bishop's (1977) theorem — see, e.g., Chavel, 1996, 
pp. 118-120 — provided we have the proper upper bound on the Ricci 
curvature along the geodesic expp(tf) in ■nj^iji). For x,y in Tg7r^^(p), 
denote K^-i^^ ^{x,y) the sectional curvature at q of the manifold 

■k'^{p). Then (ii) follows from the bound 

^^-/{j>uy^^y'^^^i(v)^iy^^y)^ l|D/(g)i|2 ■ 

To prove this inequality, extend j/ to a vector field in the tangent 
bundle T'K^{p). The second fundamental form of the immersion 
7r^^(p) C Aj(p) is given by B{x,y) = Vio]^^^-Lf^^^{Vxy) where V 
is the connection on A/(p). In particular, if x is parallel — which is 
the case for x = expp(t^;) — then i?(a:, x) = 0. Hence — see, e.g., 
Theorem 2.2 in Chavel, 1996 — if x and y are of unit norm. 



We conclude in using Proposition 4.2.3. 
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4.3. What should the result be? 

Combining the results of sections 4.1 and 4.2, the heuristic argument 
at the end of section 3 and (3.1.2), we see that we should expect 



\^Va |DAp)M.G5^^^_1(^^(0,1) 



4.d-k-2 



te[0,eA(p,v)] 



l[o,u]{l\^I(p)\{GA{p)tv,tv)y^ 

dt diJ,p{v) dM-pA {p) dit , 



where Gyi(p) = ^Ai(a),p ~ ^dA,y Assuming for the time being that 
everything works as expected, we can complete the calculation of the 
asymptotic equivalent. Recall that the volume of the unit ball of M" 



isa;„ = WVr(f + 1). 



4.3.1. PROPOSITION. For any a, b both positive, the following holds, 

p-^A {[>) 



lim / e 



-au 



I I 



I[o,H [l\BIip)\{GA {p)tv, tv) ) t"-^-^ dt diip{v) dMvA iP) 

Q-TAip) 



I 



jj^dMvAip)- 



peVA |D/(p)|(<^-'=+i)/2(detGAb)) 



REMARK. In Proposition 4.3.1, the integral in t is in the range 
[0, oo), while it is in the range [0,eA{p,v)) in the integral at the 
beginning of this section. But this will not make any difference 
ultimately. 

Proof. Notice that t'^~''~'^dtdfip{v) is the Lebesgue measure in the 
tangent space Tp7r^^{p) = R"^"^"^. Therefore, 

/ / l^o,bu](h^Iip)\{GA{p)tv,tv)y-''-^dtdfip{v) 



4.3. What should the result be? 
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\mp)\ 



2bu \(d-k-i)/2 ojd- 



\BIip)\ 



k-l 



{det Ga{p)) 



1/2 



Hence, the integral for which we want to take the hmit as uq tends to 
infinity is 



Jue[o,uo] J pi 



uo] JpeVA |D/(p)| 



(d-fc+l)/2 



(26) 



(d-fc-l)/2 



^(d-k-i)/2 dMvAp) 



[det Ga{p)) 



1/2 



We perform the integration in u and let uq tends to infinity to obtain 
the term T(^{d — k + l)/2) in the Proposition. ■ 



5. The asymptotic formula 



We now have everything that we need to derive our asymptotic 
approximation for fj^e~^^^^dx as the set A tends to infinity nicely. 
The assumptions that wc require may look quite bad at first glance. 
However, the reader will sec in the next chapters that they are in fact 
quite well tailored for applications. 

We will assume that for any fixed positive M, there exists some 
positive number ca,m, depending on A and meeting the following 
requirements. 

We first assume that there exists a manifold Va such that 

Va is a dominating manifold for the set AriTj(^A)+CA m ' (5 1) 
of fixed dimension k. 

Hypothesis, (5.1) contains two key requirements. First, k docs not 
depend on A, restricting the class of sets A that we consider. Second, if 
Va is a dominating manifold, it has to be a base manifold. Therefore, 
dA can pull away from Va only in the orthogonal directions. This is 
a restriction for instance if Va is a closed curve with two boundary 
points; we may want to allow dA to pull in the outward tangent 
directions at the boundary points. However, by breaking A into 
smaller pieces, the restriction can be overcome in practice. 

Let us denote by Aj^ the projection of ^ fl ^i[a)+ca m 
through the normal fiow tp, that is 

Am = {p^ ^I{A) ■ TAip) ^ CA,M } ■ 
Under (5.1), it makes sense to assume that 

Am^ [j ^A,p , (5.2) 
peVA 

where uja,p is defined in section 3.2. 

When dA is smooth. Proposition 4.1.1 gives a quadratic approx- 
imation of TA- Of course, such an approximation is local and does 
not have any kind of uniformity with respect to A. Moreover, looking 
at (3.2.4), we also would like to have a quadratic approximation for 
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ta{q) — ta{p) for q near ^(p, r^(p)). How near? Well, we need to 
cover Aj^. But Proposition 4.1.1 suggests that 



D/ 



should be a good approximation for TA^q) — ta{p)- Since ca,m 
is expected to be small compare to I{p) = I {A), we should have 
i^{p,TA{p)) quite close to p, and so maybe the quadratic approxima- 
tion given in Proposition 4.1.1 is just fine. This is what happens in 
many interesting examples. But for the time being, there is no other 
way than to force it, and assume that there exists a linear map Ga{p) 
on TpAj^^A) such that 



lim sup 



TA{q) - TA{TrA{q)) 



l\Dl{7rA{p))\{GA{7rA{q))exp, 



^ (g),exp ^ 



- 1 



= 0. (5.3) 



This can be rewritten as 



lim sup 

TA{eWpi\v\))^CA,t 



TA{expp{v)) -ta{p) 



l\BIip)\{GA{p)v,v) 



- 1 



Now that we have a dominating manifold and a quadratic approxima- 
tion, it makes sense to proceed as the heuristic argument at the end 
of section 3 suggests. Then, we can guess the asymptotic equivalent 
of the integral in using the result of section 4.3. Define 



Cam = i^M ^ • -^(^(^) + c) ^ j^e 



-I{A) 



d-k+l 



Va |D/|^^(detGA): 



Since all our asymptotic analysis is driven by the desire to have the 
integral influenced mainly by the behavior of dA near Va, and to 
reduce the integral to Ar\Tj(^A:)+cA m where things go well, we assume 
that 

CA,M ^ c\m ■ (5-4) 

Any ca,m larger than will do, and the reader will easily see that 

the larger cam more stringent our assumptions are. Hence, 

M ^^^t choice, but can seldom be calculated exactly. In 

practice, picking for ca,m an asymptotic equivalent of a multiple of 
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M ^^^^ ideas, a typical order of magnitude of ca,m 

is log 7(^4) for the applications that we will study. 

Though we want to be able to localize the study of the integral 
to points of A near Va, we still want A to have some thickness! 
In particular, we do not want the main contribution in the integral 
to come from the thinness of yl — think for instance of taking 
A = Tc+e \ Tc for e = exp(— e"^) or even smaller, and looking for 
asymptotics as c tends to infinity. This can be ruled out by assuming 
that the first exit time of the normal flow after a time ta is large 
enough, namely, that for all positive M, 

lim inf x;^(ji) = +oo. (5.5) 
A—*oo peA^ 

Since Xa ^^^^ equal to Xa' assumption (5.5) implies 
hm inf XAip) = ■ 

A—*oopeAj^ 

In the same spirit, we see that it does not make any difference in 
the asymptotic analysis if we replace ^ by ^4 n T'i(^a)+ca m- Thus, if 
CA,M stays bounded, the set A is quite small in the geometry of the 
level sets of /. In this case, we would need to deal with an analogue 
of Proposition 4.3.1, but keeping mq fixed. This would introduce an 
incomplete gamma function. In order to simplify the result, we assume 
that ca,m is chosen such that 

lim CAM = +00 . (5.6) 

A—*oo ' 

This assumption is satisfied in all the applications that follow; but the 
reader will see that up to changing some constant in the asymptotics, 
the proof still goes through without it. 

Our next condition can be explained by first thinking of two difficult 
situations. Imagine that around the dominating manifold T>a, the set 
OA is pulling away very slowly from A/(^). Thus, going in the normal 
direction to "Da on Aj^a)i need to take q very far away from "Da 
in order to have TA{q) not too close to 0. In such circumstances, we 
should need an extra rescaling so that in the new scale ta grows faster. 

Another difficult situation would be to have the level sets Fc 
concentrated along a proper subspace of — think for instance, 
if d = 2, of some ellipsoid with one axis growing like and the 
other like c. On a large scale, the problem would be essentially 
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lower dimensional. Notice however that in such circumstances, the 
curvature of the level set should be of different orders of magnitude at 
different points. Near points of high curvature, the normal flow could 
pull away very slowly in the Euclidean geometry. And so there is not 
much hope to localize the problem near the boundary of A. 

A way to take care of these different situations is to relate the 
curvature of the level sets with the rate at which dA pulls away from 
Va in the normal directions. Define 

io,Mb) = supjt : inf ^^TA{expp{tv)) ^ ca,m^ ■ 

Whenever q in Tr^^{p) is at distance to,M{p) or more from p, it satisfies 
ta{(i) ^ ca,m- We then assume that for all positive M, 

lim sup J Krna^ip, to^uip)) *o,m(p) = . (5.7) 



peVA 



In order to be able to use Proposition 4.2.4.i, we impose that for 
any p in Va and any unit vector v in TpTr'^^{p), 

the Ricci curvature along the geodesic expp{tv) in q\ 
"""a^W '^^M is nonnegative. 

— I am inclined to believe that convexity of / is enough to guarantee 
(5.8) but could not prove it. 

It is also possible that the set A and the function / are such 
that |D/| varies widely in a small neighborhood of some point in the 
dominating manifold. In such situation, / would increase very fast 
in some specific directions, and much slower in others. Then, the 
rescahng needed, even in T^A^ip) — when 7r2^{p) is of dimension at 
least 2 — could not be homogeneous in different directions. This can 
be ruled out by assuming that for any positive M, 

Jim^sup{||^^-l| ■■peVA,qeAj^nnj\p)} =0. (5.9) 

In order to proceed along the lines of the final remark in section 3.1, 
we need that 



A 



, \\D^lUt{p))\\ . 

hm sup <^ sup f -P^ Am^ ta{p) < oo > =0. (5.10) 

t>o |D/(V't(p))| 
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This assumption mainly controls the growth of /. A sufficient condi- 
tion to guarantee (5.10) is of course to have ||D^/(p)||/|D/(p)| tends to 
as \p\ tends to infinity, which means essentially that / grows slower 
than any exponential function. 

Actually, we also need a rate of convergence in (5.10), but in a 
rather weak sense, namely that 

Jini^sup|y^ ll_|(^„(p))du:;pG^^| =0. (5.11) 

Finally, we need two technical assumptions in order to carry over 
the intuitive argument at the end of section 3.2, namely that for any 
positive M, 

lim sup |j7rA(g) - l| = 0, (5.12) 



and that for any fixed positive w, and, as A moves to infinity, 

g— TA 

= o( [ ^n^^ ^dMvA- (5-13) 

Often ta{-) vanishes on Va- Then (5.13) is satisfied whenever (5.6) 

is. In particular, this is always the case when "Da is a single point or 
the union of a finite number of points. Similarly, (5.11) and (5.12) 
always hold if Va reduces to a point. 

The reader may legitimately be suspicious about these assumptions, 
and how they can be checked in applications. We will show in 
nontrivial examples that they are not too difficult to verify. They 
rediicc the problem of approximating the integral to much more 
manageable small problems, which can be handled by systematic 
methods. The key point to understand is perhaps that as ca,m is 
usually much smaller than I {A) , the set Aj^^ is actually quite close to 
Va in the scale given by /. 

We can now state our main result. 

5.1. THEOREM. Assume that dAnTj(^A)+CA m ^'^ ^ smooth — twice 
differ entiahle — manifold for every positive M. Then, under (5.1)- 
(5.13), 



L 



e-A^)d.~e-^(^)(2.)^ / --^ ^dA^^, 

Jva |D/|^^(detGA)2 
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as A moves to infinity, and with k = dim 2?^. 

REMARK. If "Da is an open subset of A/(^), then k = d—1, and the 
formula should be read with the determinant of Ga to be 1. 

Proof. Wc obtain an upper and a lower bound for the integral. 

Let us start with the upper bound. Let e be a positive number. 
Assume first that, for M large enough, 

A = Anrj^A)+c^M (5.14) 

provided I{A) is large enough. Thus, for p in ^i[A)i either ta{p) is 
less than ca,m or is infinite. 

Recall that ipo* is the identity since i/jq is the identity as well. 
Consequently, assumption (5.11), Lemmas 3.1.6 and 3.1.8 imply that 

for taIp) ^ CA,M, 

1 ^ det(V^^^(^),,V.,(p),J^/' 

^ exp J "|D7p" (^"*^^^)'^"J ^ 1 + e 

provided I {A) is large enough. It then follows from (5.5), (5.9), (5.11), 
Lemma 3.1.2 and Theorem 3.1.10 that 

as I{A) tends to infinity. Under (5.2), the set { < oo } is included 
in LOA- Consequently, (3.2.3) yields, as A tends to infinity, 

|D/(P)| Jy, 



pgVa |D/(p)| Jves^ -1, ,(0,1) 

P A 



L 



I[o,n](TA(expp(to)) - TA{p)j \DI{p)\ 
\&[o,eA{j>,v)) \J'^A{fiWp{tv))\ |D/(expp(to))| 

det^(i, v) dtdnp{v) dMvAip) (l + o(l)) . 

Combine (5.8) and Proposition 4.2.4.i to upper dctA{t, v) from above. 
Use (5.9) to get an upper bound \DI{p)\/\Dl{expp{tv))\ by 1 + e, and 
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(5.12) bound | J7rA(expp(to)) | by 1+e. Then, assumption (5.3) yields, 
for I{A) large enough. 



I I 



\mp)\ . 



d-k-2 



dt dnp{v) dMvA ip) (1 + e) 

Extend the integration in t over the domain [0, +oo) and use Propo- 
sition 4.3.1 to conclude 



/^e-(^)d. ^ e-(^)2^r(^:±±l)a;,_,_ 



1 X 



r(l + e) ^ • 



peP4 1D/|'' 2^"^ (detG^)2 



Since e is arbitrary, this yields the proper upper bound since 
'd-k-V 



d-k-l _ 

r 



i^d-k-i = (27r)" 



Before we can drop assumption (5.14), we need to prove the 
lower bound. To do so, apply Theorem 3.1.10, Lemma 3.1.6 with 
assumptions (5.5) and (5.10) to obtain, as A moves to infinity. 



Notice that 

|D/(^.,(p)(p))| 



-ta(p) 



1 r^^'i'^ d 



dAlA_(p)(l + o(l)) 



-2 



(5.15) 



|D/(p)| 



exp 



ds 



log|D7(V>(p,s))|'ds 



1 /•^^(P) (D^/. iV,iV) 



exp y- / 2 



^ exp 



D7 



|D/|2 

2 (V'.b))d5) • 



(V'(p, s))dsj 



This last inequality and assumption (5.11) imply 
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ace 



Consequently, up to a multiplicative factor of (l+o(l)) , we can repl 
D/(V'ta{p)(p)) D/(p) in (5.15). Wc then use equality (3.2.3) and 
proceed as follows. First, we change the variable u into ta{p) + w, 
and restrict the integration to w between and wq for some positive 
Wq. Second, we restrict further the domain by integrating only over 
the points p in Va with ta{p) less than ca^m — uiq. On this range, 
wc can use assumptions (5.9), (5.7), Proposition 4.2.4-ii, assumptions 
(5.12) and (5.3) to obtain, for I {A) tending to infinity. 



Ja JwelO; 



wo\ 



ta{p)^ca,m-wo 



pe^A |D/(p)| 



/ / l[0,«.(l-e)] (^1 D/(p) I {GA{p)tv, tv)) 

dtd/ipiv) dMvAp) d^«(l + o(l))~^ . 

Arguing as in the proof of Proposition 4.3.1, we obtain that for I{A) 
large enough, 



l%)^CA,M-wo |D/(p)|' (detG^(j3)) 
Since e is arbitrary, assumption (5.13) gives then 

/ e-^(^)d:c 
J A 

^ e-^(^)2^ / ^ dMvAip) . 

JpeVA \BI{p)\^^{detGAip))^ 

It remains for us to drop the assumption that A = ACi Tj(^a)+ca m 
the upper bound. This is immediate; for a general set A, write 

/ e-'^(^)da:= / e'^^^^dx + / e-^^^Mx. 

For the first integral in the right hand side of the above equality, the 
theorem — proved in this case! — gives the asymptotic equivalent. 
For the second one, it is less than the integral over F^^^^^^^ ^, that 
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is L(^I{A) + ca,m)- Assumption (5.4) shows that it has a negligible 
contribution to the asymptotics. 

When k = d—1 and Va is an open subset of A/(^), the asymptotic 
equivalent in Theorem 5.1 is nothing but (3.2.1). ■ 

5.2. REMARK. Assumption (5.8) turns to be difficult to check in 
practice. The main reasons are that geodesies can seldom be explicitly 
calculated, and that the curvature tensor may be difficult to calculate. 
However, we only used it to apply Proposition 4.2.4.i when deriving 
the upper bound in the proof of Theorem 5.1. It would be enough to 
have 

det^(t, v) 



lim sup 



fd-k-2 



- 1 



■■ V e S'r^^-i(p)(0, 1), 



[Q,eA{p,v)) , expp{tv) £ Aj^^ =0. (5.16) 

This condition will turn out to be easier to check in many cases. This 
could also replace (5.7) as well. 

In a similar spirit, (5.3) may be tedious to verify. Often ta cannot 
be easily calculated, but is only known via an asymptotic expansion 
as A moves to infinity. Dtic to the error term in the asymptotic 
expansion, the uniformity in (5.3), for small q very close to p, may 
be difficult to check. Therefore, we will make a rather systematic 
use of the following weaker hypothesis. Assume that there exists a 
function ta defined on Aj^ such that 

lim sup \TA{q) - fA{q)\ = 



and 

TA{q) - TA(vr(g)) 



lim sup 

A-^^q^Aj, 



1 







l\Dl{'KA{q))\{GA{T^A{q)) exp^](^)(g) , exp^j^^^ ) 

(5.17) 

for some G{p) on TpA/(^). Then, Theorem 5.1 holds when (5.3) is 
replaced by (5.17). Indeed, let e be a positive number. In the proof 
of Theorem 5.1, we now use the bound 

l[0,„(l_e)-e] (lj\^I{p)\{GA{p)tv, tv 

^ I[o,«] (rA{expp{tv)) - TAip)^ 



^ I[0,M(l+e)+e] 



^-\DI{p)\{G{p)tv,tv)) ■ 
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By the dominated convergence theorem, 



e-+0 



u>0 



hm / e-"(^x + (l±e)±e)" ^ 'du= / e'^^u'^^du, 



and one easily sees that the proof of Theorem 5.1 goes through. 

It is sometimes convenient to weaken even a tiny bit this assump- 
tion, only assuming that T^(g) — Tj^i^nAiq)) can be approximated by 
some ta(^) — 'fyi(7rA((?)) in the sense that 



lim sup 



TAiq) - TAii^Aiq)) - {rAiq) - fAii^Aiq) 



and of course keeping requirement (5.17). 

Let us now explain how Theorem 5.1 can be used to obtain 
information on limiting conditional distributions. 

Assume that we consider a log-concave density function propor- 
tional to on W^. As A moves away to infinity, there is not much 
hope for the conditional distribution 

/ e-^(^)da; / / er^^'^'^dx 
JAnB ' J A 

to converge to a nontrivial limit. Indeed, a fixed bounded set B 
does not intersect A if I{A) is large enough. So, we need to rescale 
B. For this, consider a normalizing function A ^ \a ^ (0, oo). 
We will require that the dominating manifold Va/^a converges in a 
weak sense. But for the time being, consider the rescaled conditional 
distribution 

IJLa{B)= I e-^(^)da: / / e-^^^^dx. 
JahXaB ' J A 

It converges weakly* if for any continuous and bounded function / on 
W^, the integral 

j fdfiA = Jj{x/\A)e-'^''^dx / J^e-'(-Ux 

converges as A moves to infinity. Its limit is a linear form in /, 
associated to a measure, the weak* limit of fiA- 
Let us assume that 



as A moves to infinity, the family of rescaled measures 
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, d-k+i , , ~Z~T^ TTT / / ^ IT-. n '^~''+^ 



\m{\Aq)r^{d^lGA{\Aq)Y I J ^ |D/| — (det G^)^ 
converges weakly* to a probability measure u. (5.18) 

Assume furthermore that 



lini ^ sup { |D/(p)|-i : p e 4m } = (5.19) 



and 



lim sup {\q-p\l\A : g € 7r^^(p) , ta{v) ^ c^.m } = . (5.20) 
We then have the following convergence. 

5.3. COROLLARY. Under the assumptions of Theorem 5.1 and 
(5.15)-(5.20), the conditional distribution jia converges weakly* to 
V as /(>!) tends to infinity. 

Proof. Argue as in the proof of Proposition 3.1.9 to obtain 
^^''^f{x/XA)dx = J J lA{x)lin^)^^){c)e-''f{x/XA)dcdx 



e 

A 



^V^(f,,s)|det(Vj.V.*)'^V(V'(?^,s)/AA) 
d-^A/(A)(p) dc. 

Consider a function /, bounded and continuous. After adding a 
constant to /, we may assume that / is larger than some positive 
number. Then, up to introducing a term f (^jj{p, s) / Xa) , the bounds 
in Theorem 3.1.10 remain vahd. 

Under (5.14) and arguing as in the proof of Theorem 5.1, as A tends 
to infinity, we have 



e-^(^)/(x/AA)da;~e-^(^) 



^ 

|D/(p)| 

ta(p)<°o 



e 7(V'TA{p)+sb)/AA)dsdXA,(^)(p) 
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where XAiP) ^ Xa{p) ^ XaW- Assumption (5.14), (5.19) and 
Corollary 3.1.4 imply 

1 I, / N 1^ Xa{p) . CA,M ... rKn^\ 

J^\i^r.ip)UP)-p\ ^ J^^^ ^ J^^^ = o{l) (5.21) 

as I{A) tends to infinity, uniformly in p G ^liA) with ta{p) < oo. 

Assume that / is uniformly continuous. Since / is bounded and 
larger than some positive number, (5.21) imphes 

hm sup i — ;y -1 : peAj^,0^s^ ca,m } = 0. 

A-*oo L I J{p/Xa) J 

Thus, using (5.5), 

T^(p)<00 

We make a change of variable as we did in (3.2.2). Noting that (5.20) 
implies 

hm sup<^ 1 : veTpTT^ (p) , 

TA ( expp(u)) ^ CA,M , p G I = , 

we obtain 



/ e-^(^)/(a:/AA)dx~e-^(^)(27r)^ X 
J A 



J-^A \m{p)\^{detGAip))^ 



Make a change of variable q = p/\a and use (5.18) to obtain 

J fdu, (5.22) 



^.^ X4e-^('^)/(.x/AA)dx 



for all uniformly continuous, positive functions /. 

If / is bounded, we drop the restriction (5.14), as we did in the 
proof of Theorem 5.1. Using Theorem 5.1, we can consider arbitrary 
bounded uniformly continuous function / in (5.22). This implies — 
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see, e.g., Pollard (1984) — that the conditional distribution converges 
weakly*. ■ 

Notes 

There are many things related to Theorem 5.1 that I wanted to do 
but could not. 

A first one is to understand to what extent an exponentially 
integrable density may be approximated by a log-concave one at 
infinity. Here is the beginning of what could be a proof. Let / be 
a density on R'*, such that the moment generating function 



is finite in a neighborhood of the origin. Under some classical steepness 
conditions — see, e.g., Barndorff-Nielsen (1978) or Brown (1986) — 
the differential m = is a diffeomorphism. Denote by m™"^ its 
inverse. We now follow word for word the construction of Barbe and 
Broniatowski (200?), but in a different setting. 

The function log (j) is convex. Let I be its convex conjugate, that is 

I{x) = sup{ log 4>{t) - {t, x) } . 

Using the change of variable a = s—x and Fubini's theorem, we obtain 

j y"e<™'°'(^'-^)'^)lA(s-a;)/(s)dsda; = j e-^^^^da. 

This allows us to define a new density 

Jg(mi'-(6-x-),x-)j^(^, _ x)f{s)ds 



9a{x) = 



/^e-^Wda 



The interesting fact is that 5^(0) = J^f{s)ds^ J^e ^^"'Ma. Consider 

the rescaled density r'^gj\{rx) with r possibly depending on A. If we 
can prove that r'^gA{r •) converges to a limit, say g{-), as A moves to 
infinity, in such a way that pointwise convergence at holds, then 



/ f{s)ds ~ r-'^g{0) [ e'^^^-'^da. 
Ja Ja 



Thus, when integrating over A, we can approximate the density / by 
a multiple of e~^^^\ and then use Theorem 5.1. 
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To achieve this approximation, we can calculate the Fourier trans- 
form of r'^gA{r •)• It is 

g,^A{\)=r'' j e-<^'^><7A(rx)dx = j e-'^^"'^/'') d/iA(a) , 

with 

fc(a,A) = i(a,A) +log0(m"^^(a)) - log>(m"'^(a) + ?A) , 

and 

iXA{B)= j e-f(«)da/ j e-^('^)da. 
JAnB ' Ja 

In particular, as A moves to infinity, the support of /jLa — that is A 
— moves to infinity. A Taylor expansion of k{a,X/r) near A/r = 
gives 

, , ^ , , 1 /(D2log</))omi^"(a)^ x 

k{a, A/r) = -(^^ ^A, A^ + o{X^) . 

If this can be done as a tends to infinity — which is in the spirit of 
what we did using Proposition 4.1.1 to prove Theorem 5.1 — we can 
hope to approximate 

. ... /• /l/ (D^log(/>)om'"^(a) ^ ^\^^ 

Let V{a) = (D^log(6) o m"^^(a) be the so-called variance function of 
/. Inverting the Fourier transform of the approximation, we should 
obtain 



' '^'^ " J (2.)^/Met(y(a)/r)V2 ^^-^(») " 

If we can find r depending on A such that the right hand side has a 
limit as A tends to infinity, and use Corollary 5.3, we arc done. 

Unfortunately, I could not come up with useful conditions for this 
idea to work. 

When d = 1, it is possible to prove that if 5^ converges, then its 
limit is given by a mixture of either normal densities — as we outlined 
here — or gamma ones; this follows from Balkema, Kliippelberg and 
Resnick (1999). In higher dimensions, a related approximation is in 
Barndorff- Nielsen and Kliippelberg (1999). I somewhat believe that 
the whole virtue of saddlepoint approximations used in statistics is to 
provide some form of log-concave approximation in the spirit of what 
is outlined here. But most of the time, in the multivariate setting, it 
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relies on assumptions similar to the convergence of qa, which I don't 
find too appealing. 1 have been searching unsuccessfully for a decent 
condition on / itself. 

A second project, which perhaps would be desirable to carry out, 
is to obtain higher order expansions. Theorem 5.1 provides a one 
term asymptotic expansion. Starting from the equality in Proposition 
3.1.9, one could do the change of variable using the dominating 
manifold and the orthogonal leaves; then one would use asymptotic 
expansions for whatever function is involved, and obtain the desired 
approximation. Higher order differential geometry is involved. The 
difficulty is to come up with a set of usable conditions to perform all 
the approximations. Another route would be to mimic the practice 
of Edgeworth expansions in statistics. There are essentially two 
types of them: those that arc proved rigorously, and a vast majority 
that are called "formal" . To do the formal ones, the argument is 
pretty much to neglect what one believes to be negligible under some 
quite unknown conditions, and proceed. One could obtain formal 
asymptotic expansion in the same way. This may be of some value in 
a few applications. Indeed, sometimes one may not look for a theorem 
but maybe more for a guideline. 

A third path to explore would be to derive explicit upper bounds, 
starting either from Proposition 3.1.9 or Theorem 3.1.10. In particu- 
lar, I wonder if the technique developed here could be of any use to 
investigate "asymptotic" isopcrimctric problems. 

As pointed out in the notes to chapter 3, Proposition 3.1.9 does 
not use the convexity of /. This proposition is true for any smooth 
function / for which the sets Ac are smooth hyper surf aces. There may 
be some examples where the normal flow can be calculated explicitly 
and other arguments used in order to derive an estimate similar to 
that of Theorem 5.1. 

To conclude these notes. Corollary 5.3 is inspired by the Gibbs 
conditioning principle in large deviations. In the large deviation 
context, the reader may consult Csiszar (1984) and Bolthausen (1993) 



6. Asymptotics for sets 
translated towards infinity 



In this section, we study integrals of the form J^_^^e~^^^'^dx as \t\ 
tends to infinity. To avoid any ambiguity, recall that if ^ is a set and 
t is a vector, both in W^, the translation of A by t is 

A + t = {x + t : X e A}. 

We will assume that 

C M*^ is a closed bounded convex neighborhood 
of the origin, with smooth boundary and positive (6-1) 
curvature. 

The only restriction here is convexity — and smoothness, but we want 
to be able to use differential geometric methods! It could be dropped 
at the cost of a more sophisticated discussion on how A + t and A/(^^^) 
intersect. Up to changing t by a fixed amount, we can always assume 
that A contains the origin. 

We will control the growth of / at infinity, assuming that 

,i^!2lM = o and lim ll°'5,'^^"=0. (6.2) 

p-^oo |D/(p)| p-^oo \DI{p)\ ^ ' 

The second condition forces / not to increase too fast. Indeed, for 

d = 1, it reads \I" {p) / 1' {p)\ tends to as \p\ tends to infinity. Hence, 
for any small positive e and any q large enough, 

I'{p) fP I"{t) 



I'iq) 



exp J jT^dt ^ exp {e\p - q\) . 



For instance, the function I{p) = exp(|p|°) satisfies {I" /I'){p) tends 
to as p tends to infinity if and only if a < 1. So, roughly, / should 
have a subexponential growth. A polynomial growth, like I{p) = 
with a positive a, is admissible. 

The first condition forces / to increase fast enough. Indeed, when 
d = 1, it implies that for any positive e and x < y large enough, 

I{y) - I{x) = r I'm ^ - r logimt ^{y- a;)i^^ . 

Jx ^ Jx ^ 
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Hence, ultimately, / has to grow faster that any linear function. For 
instance, the function I{t) — t(logt)" satisfies (log/(i))//'(t) tends 
to as t tends to infinity if and only if a > 1. 

We will need to strengthen the second condition in (6.2) by assum- 
ing 

^hm log /(A + i) sup { : /(g) ^ /(^ + i) } = . (6.3) 

For a strictly convex function / the level sets Y c arc strictly convex. 
There is a unique point x in M*^ at which / is minimal. Assumption 
(6.1) implies that for any t with \t\ > 2diam(74) + |a;|, 

{A + t)nAi^A+t) = {pt} 

for a unique point pt- In particular, pt — t is in dA. 

6.1. THEOREM. /// is convex and (6.1)-(6.3) hold, then 

[ e-^Wd:r W*) as t ^ oo , 

JA+t \BI{pt)\ — Kt 

where Kt is the Gauss- Kronecker curvature of dA at pt — t. 

Proof. Write At = A + t. In order to apply Theorem 5.1, we need 

to have a candidate for the dominating manifold T>Af Clearly {pt} 
should do, and we set — {pt}- Since TA+t{pt) = by definition 
of Pt, the result of Theorem 5.1 reads 

-mA e-A-^+^)(27r)^ 
e ^ >dx^ — J- ast-^oo. 

\DI{pt)\'^{deiGA+tiPt)y' 

Now, recall that we should expect GA+t{pt) to be the difference of the 
second fundamental forms of h-n^A+t) and A + t aXpt — not restricted 
to anything here, since Vai is a point and so T^^{pt) is A/(^_|_(), 
up to what is in the cut locus of pt. However, the second part of 
assumption (6.2) asserts that asymptotically, the second fundamental 
form of Ai(^A+t) degenerates, and so, locally, An^A+t) is almost flat. 
Thus, GA+t{Pt) should be the second fundamental form of d{A-\-t) at 
Pt, which is equal to that of dA at pt — t. Its determinant is exactly 
Kt. This explains how to guess the result. It is hoped that this twelve 
line argument convinces the reader that Theorem 5.1 can be useful. 
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Now that the result is guessed, let us find a candidate for CA+t,M- 
Define 

c{t) = d\ogIiA + t) + ^ log \DI{pt)\ 

— in this formula, d refers to the dimension of W^. Since I{pt) tends 
to infinity with t, the first part of (6.2) implies limt^oo iD-^lPt)! = oo. 
Proposition 2.1 yields 



L{l{A + t) + c{t)) = ———^o{l) ast-^oc. 
|D/(pt)| 2 



Given (5.4) and our twelve line argiiment, c^+j^m = c{t) is a good 
candidate, no matter what A4 is. It guarantees (5.4) as well as (5.6). 
We now check all the assumptions of Theorem 5.1. 

As noted in chapter 5, since VAt = {pt} is a point, (5.1) is trivial. 
Notice that A^ j^ is included in the projection of A + t on Aj(^A+t)- 
So it is enough to check (5.2) with A^ replaced by the projection 
oi A + t. Assumption (6.2) asserts that the second fundamental form 
of Aj(^A+t) tends to uniformly over this surface. Thus, its curvature 
tensor vanishes asymptotically and the radius of injectivity of any 
point in Aj(^A+t) tends to infinity uniformly over the surface — this 
follows from Ranch's (1951) theorem or Klingenberg's (1959) lemma; 
see, e.g.. Do Carmo (1992) or Chavel (1996). As A + t stays of finite 
diameter, (5.2) follows. 

To prove (5.3) and find GA+t, we need to have some more informa- 
tion on ^j^jvf ^'^el on the normal fiow. The idea is that ^4^^^ should be 
very close to DAt — {Pt}- To prove this fact, we first define a family 
of local parameterizations of dA. For p belonging to OA, we denote 
by z^(-) the inward unit normal vector to dA at p. By compactness of 
A, there exists a positive ei, independent of such that dAr]B{p, ei) 
can be parametrized as all points of the form p + u + fp{u)v{p) for u 
in TpdA and some nonnegative function fp. Notice that the curvature 
assumption (6.1) ensures that there exists a positive matrix Qp such 
that fp{u) = ^{QpU,u)(l + o(l)) as \u\ tends to 0. Moreover, since 
dA is smooth and compact, the term o(l) is uniform when p varies in 
dA, and the matrices Qp arc bounded bellow by a fixed positive one. 

To prove that A^ shrinks around pt, notice that ly^pt — t) and 
DI{pt) are collinear since d{A + t) and Aj(^A+t) a.re tangent at pt- 
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Convexity of / implies 

l{pt + U + fpt-t{u)v{pt - t)) - I{pt) 



-l(^pt + s{u + fp,_t{u)v{pt - t))^ 



- V .r. ...... - ,, J 

= U^-Au)\m{p>t)\. 

Consequently, the points q in dA + t such that Kcj) ^ /(A + 
i) + CA-^t,M can be parametrized as + it + fpt{u)iy{pt — t) with 



^ 0(c(t)/|D7(pt)|) = 0(1). They can also be written as 



1 



Pt + u + -{Qp^-tu,u)v{pt - + 0(1)) 



(6.4) 



where the o(l) is uniform in t and \u\^ ^ 0(c{t) /\T>I{pt)\^ . 

Since the curvature of the level set of / tends to 0, the normal flow 
should be almost like straight lines on sizeable intervals. In order to 
make this statement rigorous, and seeking a linear approximation of 
the normal flow with good error bounds, an elementary calculation 
shows that 



D( 



D/ 



1 



ID/I VID/ 



2N®N 



|D/| 



1 



In particular, this implies the inequality 



D 



\\DI\^) 



1 IID2/I 



|D/| |D7| 

Notice also that assumption (6.2) insures that 



(6.5) 



^^^^ = '"^p { '■ ^^'^^ ^ ^^^'^} = ^^^^ * 



00 . 



|D/(g)| 

Using Lemma 3.1.1 and (6.5), it follows that with q = expj,^('u). 



i^iq,s) -q- s 



BI 



(9) 



rs pr 

Jo 

v{t) 



d D/ 
di^|D7|2 

.2 



(^'(9, i'))df dr 



|D/(,)| 2 ■ <^-« 

Next, let us prove that |D/(g)| ~ |D/(pt)| as t tends to infinity 
provided u stays bounded. Writing 7(5) = exp^^ (su/|n|), 

-log|D/(7(s))|ds 

"Ml(D2/(7(s))7'(s),D/(7(s))) 



log|D/(g)|-log|D/(pt 



iD/(7(.))r 



■ ds . 
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from which we obtain the bound 

|log|D%)|-log|D/(;pj)|| ^y7?(t) = yo(l) ast^o<6.7) 

Furthermore, we have a good control on the oscihations of D//|D/p 
in using (6.5); namely, for t large enough and say \u\ ^ 1, 

D/ _ V)I , A /"I"! d D/ 



|D/| 



D/ I /"I"! d D/ 



2\u\ 



Consequently, for |u| ^ 1, the inequality (6.6) gives the bound 



D/ 



27?(t) 



|D/(pt)| 



This is the linear approximation of the normal flow that we were 
looking for. Considering s = TA+t{(l) and using the linear approxima- 
tion for the exponential map in Proposition A. 2.1 — remember that 
q = expp^ (n) — we then obtain 

t/j{q,TA+t{q)) =Pt + u + rA+t(g)|^^| 

+ +^A+t(«)l"l)0(l) +^(OI«l'0(l) 

where the 0(l)-terms are uniform in \u\ ^ 1 as \t\ tends to infinity. 
Since ijj[q, TA+t{q)) is in the boundary of ^4 + 1 by the very definition 
of TA+tio), and since u belongs to Tp^Aj^p^^, (6.4) forces us to have 

= ^{Qpt-tu,u)u{pt - t){l + o{l)) 

as t tends to infinity, and uniformly in \u\ ^ 1. Therefore, provided 
ri{t)TA+t{Q) = o(l), we obtain 

TA+tiq) = ffi^(Qp,-t^,n)(l +0(1)) 

uniformly in \u\ ^ 1, as t tends to infinity. Since rj{t)TA+t{q) ^ 
rj{t)c{t), assumption (6.3) ensures that rj{t)TA+t{q) = ^(1), and we 
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proved that (5.3) holds with GA+t being the second fundamental form 

of dA at pt — t. 

Given our checking of (5.3), (5.5) is obvious since shrinks 
around pt — see the proof that |np = o(l) before equation (6.4) — 
and (5.5) holds. 

Assumption (5.7) is trivially satisfied. The shrinking of A^ jy^ to 
{pt} and assumption (6.2) — which implies that the curvature tends 
to 0; see also Proposition 4.2.3 — imply that to,M(p) = o{l) and 
Kniax{p,to,M{p)) = o{l) Uniformly over A^^m ^ tends to infinity. 

Assumption (5.8) is satisfied since / is convex, VA+t is a point, and 
the first part of Proposition 4.2.3 holds. 

Assumption (5.9) follows from (6.7) and the shrinking of A^^j^ 
around pt- 

Clearly, (6.2) imphes (5.10). 

Assumption (5.11) is implied by (6.3) and Lemma 3.1.2, while (5.12) 
holds systematically for A; = 0. 

Since TA+t vanishes on VA+t, (5.13) holds as well, and this con- 
cludes the proof of Theorem 6.1. ■ 

We obtained the conclusion of Theorem 6.1 by a brute application 
of Theorem 5.1. A little extra work makes the asymptotic formula 
nicer, replacing the term |D/(j)j)| by |D/(t)|. 



6.2. COROLLARY. Under the assumptions of Theorem 6.1 
jA+t |D/(t)|('^+i)/2K, 



e-A-)dx ^ . e-^(^+^) ast^oo. 



Proof. Since 



A log |D/(t + su)? = ^ {^'Iit + su)u^mit + su)) 
ds ^' ^ ^' \m{t + su)\^ 

the inequality 

I log \mit + u)\^- log mt)? 1^2/' ''^^''/Z^'ti'inJl M 

7o \i->l{t + SU)\ \L)l{t + SU)\ 

holds. Compactness of A, convexity of / and (6.2) imply that the right 
hand side of the above inequality is o(l), uniformly in u belonging to 
A a&t tends to infinity. Thus, |D/(t + it)| ~ |D-/^(i)| uniformly over u 



6. Asymptotics for sets translated towards infinity 



75 



in A as f. tends to infinity; and we can replace |D7(pt)| by |D/(i)| in 
the statement of Theorem 6.1. ■ 

In general, we do not have I{A+t)—I{t) = o(l) as t tends to infinity. 
This is easily seen when d = 1 and I{x) = for instance. Thus 
we cannot replace I{A + t) by I(t) in Theorem 6.1 or Corollary 6.2. 
In some instances, it is possible to obtain an asymptotic expansion 
for I {A + t). We illustrate this fact in the important case when I 
is a-positively homogeneous. To state the result, recall the notation 

= D//|D/| for the outward unit normal vector field to the level lines 
of /, and set 11 — D^//|D/|. The compression of 11 to the tangent 
space of a level line Ac is its second fundamental form. 

6.3. PROPOSITION. Assume that I is a-posittvely homogeneous 
and smooth. Assume also that A is a neighborhood of the origin with 

a smooth boundary. Let e be a unit vector in W^. Define r by the 
condition —rN{e) G dA. Then, as A tends to infinity, /(Ae + A) 
admits an asymptotic expansion over the powers A"~*, i G N, and 

I{A + Ae) = A"7(e) - A'*-V|D7(e)| + -^r'^{IiN{e) , N{e)) 
^ r'^T)'^I{e){N{e),N{e),N{e)) 



6 



r^{Ii{e)Q--l^^,^m^N{e) , Ar(e)) + 0(A"-^) . 



Proof. Since A is compact and / is smooth and a-positively 
homogeneous, we have, uniformly in u belonging to A and as A tends 
to infinity, 

7(Ae + u) = A°7(e + u/\) 

= A"7(e)+ ^DV(e)(n,...,w)+0(A"-'=-i). 

^^^^'^ i times 

The expansion of 7(Ae + 74) follows by induction. The computation of 
the first terms can be done by introducing the point u\ = u{\, e) in 
A, such that /(e + A/X) = I{e + u\/\). Since A is convex, compact, 
and I is convex, u\ in dA for A large enough. 

Taylor's expansion gives, uniformly in u belonging to A, 

+ = ^(') + ^(~(«) .») + ^(n(e)»,») 

+ .^V,^I{c){n,u,n) + 0{\-^) 
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as A tends to infinity. Consequently, u\ = —rN(e) + ui^x/X where 
ui^x = 0(1) as A tends to infinity — because ux has to minimize 
/(e+u/A), and so should minize {N{e),u) as well, up to a term of order 
A~^. Since e + ux/X belongs to dA, using the notation of the proof 
of Theorem 6.1, there exists a vector v = Vx T_j.N{e)9A = N{e)^ 
such that 



A A ^ '"^"^VA. 

= J + ^{Q-rN{e)V,v)N{e)+0{X- 



3^ 



as A tends to infinity. It follows that 
ux\ J., r|D/(e)| 



/(e + ^) = /(e) - + ^|D7(e)|(n(e)iV(e) , N{e)) 

\DI(e)\ r 
+ ^-^{Q-rNi^e)V ,v) - ^|D/(e) | (n(e)i; , 7V(e)) 

'%^^I{e){N{e),Nie),N{e))+0{\-^). 



6A3 

used that Q is symmetric — and its minimum value is 



The term in l/A^ is smallest when v = rQ _^^,^VL{e)'^ N{e) — we 

ill ■ 



'^^^'lr2(n(e)Q:i^,,,n(e)^iV(e) , N{e)) 

^3 

6A2 



D3/(e)(iV(e),iV(e),iV(e)). 



This completes the proof. ■ 

In particular, for a = 2, we can replace I{\e+A) in the exponential 
term of the asymptotic equivalent by 

A2/(e) - Ar|D/(e)| + ^(n(e)7V(e) , N{e)) . 

In the Gaussian setting, I{x) = — |xp/2. Thus, \DI{e)\ = 1 and 
n(c) = Id. The exponential term simplifies to 

A2 ^ r2 
— ^ - Ar + — . 
2 2 

A neat expression, but very specific to the Gaussian distribution. . . 
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Similarly to what we did in Corollary 5.2, we can obtain a result 
on conditional distribution. It is easy to prove that if X is a random 
variable with density e~^, then, the conditional distribution of X/\t\ 
given X ^ A + t can be approximated by a point mass at under 
the assumptions of Theorem 6.1. However, Theorem 6.1 itself leads 
to a more precise result. 

6.4. COROLLARY. Let X he a random variable with density propor- 
tional to . Let e be a unit vector in M"^. Under the assumptions of 
Theorem, 6.1. the conditional distribution of X — Xe given X £ A + Xe 
converges weakly* to a point mass at —rN{e) G dA as A tends to 
infinity. 

Proof. Let U he a neighborhood of —rN{e). We can find a closed 
convex set B inU with smooth boundary and positive curvature, such 
that dB and dA coincide in a neighborhood of —rN{e). Applying 
Theorem 6.1 twice, we see that 



Consequently, the conditional distribution of X — Ae given X & A + Xe 
is asymptotically concentrated on _B C U. Since U is an arbitrary 



A slightly more involved proof would show that the conditional 
distribution oi X — t given X £ A + t can be approximated by a point 
mass Skt pt — t as \t\ tends to infinity under the assumption of Theorem 



This chapter has three motivations. First it provides a simple example 
of applying of Theorem 5.1, and I hope it is of pedagogical interest. 
Second, translating a set away from the origin may be one of the most 
intuitive and natural ways to make it moving to infinity. Third, and 
this is more important, the Gaussian case has received some attention, 
due to statistical applications. In LeCam's theory of local asymptotic 
normality — see e.g., LeCam (1986) and LeCam and Yang (1990) 
— the asymptotic power of a test is given by the probability that a 




small neighborhood of —rN{e), the result follows. 



6.1. 



Notes 
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noncentered Gaussian vector lies in a given domain. Thus, one issue 
is to calculate the probability that a centered Gaussian vector hits a 
translated set, typically an ellipsoid. The work of Breitung (1994), 
Breitung and Hohenbichler (1989), Breitung and Richter (1996) are 
most relevant here. The remarks following Proposition 6.3 somewhat 
enlighten the Gaussian case. 



7. Homothetic sets, 
homogeneous I 
and Laplace's method 



In this chapter we consider a set Ai such that 

there exists a neighborhood of not intersecting Ai (7.1) 

Equivalently, we could say that the complement of Ai is a neighbor- 
hood of the origin. This assumption ensures that the sets At = tAi 
are moving to infinity as t tends to infinity. Assume furthermore that 

/ is a strictly convex, a-positively homogeneous function, (7.2) 

that is I{tx) = t°'I{x) for all nonnegative t, all x in W^, and some 
positive a. Under such assumptions, a must be strictly larger than 1 
to ensure strict convexity. Setting x = ty, we see that 

J At J Ai J Ai 

The asymptotic decay of the last term in the equality is related to 

the Laplace method. When I has a unique minimum in Ai, not 
on the boundary of Ai, this type of integral has been well studied. 
However, here, I{Ai) is achieved on the boundary of Ai, eventually 
on a A;-dimensional submanifold of M*^. A direct proof of an asymptotic 
equivalent of the right hand side, working out a multivariate Laplace 
method, is quite tractable. However, for purely pedagogical reasons, 
we will obtain an asymptotic equivalent of the right hand side of the 
above equality by using Theorem 5.1. This proof does not require 
more work than a direct one. The equality with the right hand side 
makes it easy to understand how Theorem 5.1 works. It also shows 
that Theorem 5.1 can be thought as a generalization of Laplace's 
method. 
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Consider T>a^ = AiD Aj^j^^-^ and assume that 

dAi, Vai and Ac are smooth — twice continuously , . 
differentiable — manifolds. 

Let k be the dimension of Va^ ■ We assume that Ai separates from 
Ai{Ai) with contact of order 1 exactly, and therefore, 

det Gai {p) + for any j? G • (7.4) 

We also need to make sure that A\ is not a thin {d — l)-dimcnsional 
layer against t^Ax ■ For instance we could assume that it is equal to the 
closure of its interior. Such an assumption is global. We can work with 
a much weaker local one. Roughly speaking, for p in D^i ■, we need to 
be able to squeeze a ball in the intersection of A\ with the forward 
image of a leaf Ti'^ij)) through the normal flow. This guarantees some 
thickness near Dax along the section of A\ orthogonal to Da^- The 
exact assumption is that 

there exists a positive e such that for all j) in Dax and 

any unit vector v in TpTr^^^, any s,t] in [0,e] the set (7.5) 

A\ contains i) [ expp(7yt)) , taj ( expp(77v)) + s ] 

The following is then a consequence of Theorem 5.1 and is a 
multivariate Laplace type approximation. 

7.1. THEOREM. Under (1.1}-{1.5), and ifVA^ is a base manifold 
for A\, then 



I At 

where 



J At 



ci = (27r)^ / 
Jt 



d-k-i /' d7VJx)Ai 



Va, |D/|^(detGAj^ 



As in Theorem 5.1, the asymptotic equivalent in Theorem 7.1 must 
be read with detG^i = 1 if T>Ai is an open subset of Ai(Ai) and 
k = d-l. 

Before proving Theorem 7.1, notice first that for a = 2, the 
polynomial term in t in the approximation has exponent A; — 1; it 
docs not depend on the dimension d of the ambient space. More 
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importantly, no matter what d is, this exponent can be written as 
+ it is an increasing function of A;, as one should expect. 

The proof of Proposition 4.1.1 shows that Gai{p) = — 
^SAi p difference of the fundamental forms of J^i{Ax) a'lid dAi 

compressed to the direction orthogonal to Va^ ■ 

During the proof of Theorem 7.1, we will make use of the following 
result, relating the large scale analysis of tA^ to that of Ai as far as 
the normal flow is concerned. 

7.2. LEMMA /// is a-homogeneous, then 

(i) ip{p,s) — tr^ijj{tp,t°^s), and 

(ii) TtAiitp) = t"TA^{p). 

Proof. To prove (i), write tp{p,s) = t~^'il;{tp,t"s). Since D/ is 
(a — 1) -homogeneous, Lemma 3.1.1 yields 

Thus, ^ obeys the differential equation of Lemma 3.1.1. It equals ^ 
since ip{p, 0) = p = ip{p, 0). 

Assertion (ii) follows since tai{p) is the first positive time s such 
that tip{p,s) is in tAi, and 

tp [tp, eTA^ (p)) = ti/j {p, TAi (p)) 

thanks to assertion (i). ■ 

Proof of Theorem 7.1. Since all the assumptions used in Theorem 

5.1 depend on CAt^M, we first need to guess its vahic, and then proceed. 
To this aim, we first need an estimate on the integral itself. We obtain 
it in evaluating the asymptotic equivalent given by Theorem 5.1. 
It is natural to consider 

Va, = Wa, = t { X e Ml : I{x) = I{Ai) } . 

Using the homogeneity of /, 

"Diitp) = e-^m{p) . 

Let p be in Vai , or equivalently, tp be in T>At ■ To estimate Gai {tp) , 
the equality (^t, Aj(^^)) = {tAi,tAi(^Ai)) gives 

GA,{tp)=t-'GAAp) 
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— the curvature tensors are rescaled by think of the sphere of 
radius t whose curvature is 1/t times that of a sphere of radius 1. We 
also have 

Consequently, 

Jva^ |D7|^— (detGAj2 

where ci is given in the statement of Theorem 7.1. Notice again that 
in a very few lines, Theorem 5.1 allows us to guess the result. 

As pointed out in chapter 5, the larger CAt,M is, the stronger the 
assumptions are. However, it is important to remember that all that 
we need is to find CAt,M larger than j^. So, consider a positive e 
and let 

c{t) = (da-k + {a- 2)^^-^ + f + ^) ^^^^ • 
From Proposition 2.1, we infer that 

L{l{At) + c{t)) = L{t^I{A,)+c{t)) = o(e*"^(^i)t*^-("-2)^-§) 

(7.6) 

as t tends to infinity. Thus, c^^ ^ is less than c{t) for t large enough 
and any positive M. We can try to choose CAt,M to be c{t). In this 
case, we just proved that (5.4) is satisfied. 

It should be noticed that our choice of c{t) is very naive. We 
inverted asymptotically the function L and evaluated the inverse at 
the guessed asymptotic equivalent for f^^ e~^^^^dx. The addition of 
the term elogt in c{t) is only to obtain (7.6). 

We now proceed in checking all the assumptions needed to apply 
Theorem 5.1. We already chose a candidate for the dominating 
manifold, 

VA,=tVA^=t{xe dAi : I{x) = I{Ai) } . 

We postpone the check of (5.1)-(5.2) to the end of the proof since it 
requires some discussion. 
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To describe Aj.^^, let 

= {j3 G A^(^^) : taAp) ^ c{t)/e } . 

This choice ensures that 

At,M = {P^ ^I{At) ■ TAtip) ^ c{t) } = tAi^t . 

Thus, wc can look at ^ through its rescaled version Ai^f 

We check (5.3) through a rescaling. Let tq be in A^ and define 
tp = TTAtitq)- The point p is in Vai- Lemma 7.2 implies 

TAt (tq) - TAt itp) = t"' (tAi (q) - Ta^ (p)) • 

Since q belongs to Ai^t, we have TAi{q) ^ c{t)/t"] in particular, TAi{q) 
converges to as t tends to infinity. Since ta^ is continuous on tt^^ (p) 
with a strict minimum on "Dai, we have — p| = o(l) uniformly 
in q belonging to Ai^t — recall p = iTAiiq)- then follows from 
Proposition 4. LI that 

TA,{q) - taAp) = I \mP)\ (GA,b)exp;i(g) ,exp;i(g))(l + o(l)) 

uniformly in g G Consequently, 

TA,{tq) - TAAtp) = Y \BI{p)\ (GAib)exp;i((?) ,exp-n(/)>(l + o(l)) 

as t tends to infinity, uniformly for tq in Af.^j^. This proves (5.3) here. 
Notice that since (expp)*(0) = Id, we also have the approximation 

TA, (q) - TA, (p) = \ |D/(p) I {Ga, (p) (g - p) , (g - P)) (1 + 0(1)) , 

as q converges to which may look more familiar. 

To verify (5.5), Lemma 7.1.2 shows that X^^{tp) = t°^x^^{p)- Thus, 
it suffices to prove that inf^^^^ ^ ^P) uniformly bounded below 
by some positive number for t large enough. If q belongs to Ai^t, write 
q = exp^^ {q){V'v) lor V = dist(g, vryij (g^)) and a unit vector v. As we 
have seen, rj converges to uniformly over q in Ai^t as t tends to 
infinity. Therefore, (7.5) implies that for t large enough, Xai(') ^ ^ 
over Ai^f. Thus, (5.5) holds. 

Our choice of CAt,M = c{t) ensures that (5.6) holds as well. 



84 



Chapter 7. Homothetic sets, homogeneous / and Laplace's method 



To check (5.7) is not much more complicated. Define 

We first notice that for any p in A/(^^) and s positive, the definition 
of i^max in Proposition 4.2.4 and homogeneity of / imply 

Kras^itp, S) ^ sup I \J^J^tq)\-^ • 1 ^ I = ^ • 

Identifying TpA/^p^ and Ttph.i{ip^, the equality expjp(it;) = texpp(w/t) 
holds. It gives, 

io,M(ip) = sup I s : inf tmi ( exp^p(sf )) ^ c(t) | 



TfpT^^ (tp) ^ 



= tsup<^ s : inf TA^ (exp„(si;)) ^ c{t)lf \ . 

Again, as c{t)/t'^ converges to as i tends to infinity, the requirement 
( expp(sL')) ^ c{t)/t'^ forces sv to be o(l) as t tends to infinity. 
Proposition 4.1.1 yields 

to,M{tp) ^ tsupj s : , \\^Kp)\{Gai{p)sv ,sv) ^ | 

for t large enough. Since Gai{p) does not have null eigenvalues on 
Tp'Kj^{p) thanks to (7.2), to^M(tp) is at most 



tsup{s:s' inf hBI{p)\{GAMv ,v) ^ ^} 



as i tends to infinity. Consequently, 

sup J K^a.{tp,to,M{tp))to,M{tp) = t-^/^v/cW 0(1) = o(l) 

as t tends to infinity, and (5.7) holds. 

Following Remark 5.2, we will not check (5.8) but (5.16) instead. 
Adding subscripts to distinguish on which manifold we are working, 
we have 
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for all s in [O, eA{tp,v)') and all v in the unit sphere of TtpTT^^{tp). 
This unit sphere can be identified with that of TpTr^^{p). As we have 
seen, if exp^p^sv) belongs to A^^j^, then t^~"'/'^s/f = o(l) for any 
positive T], as t tends to infinity, and uniformly in p belonging to Va^ ■ 
It follows from compactness of Va^ and the classical expansion for 
A -1 {s, v) as s tends to — see, e.g., Chavel (1996) — that — recall 

a > 1! — 

A-4itp)i^^^) = '^dT.,.-4(tp)+0{syt') = .Id^^^,-i(,^) +0(1) (7.7) 

as t tends to infinity, and uniformly for p in "Dai- This implies (5.16). 

Assumption (5.9) is easy to check. Since p belongs to Vai and q to 
Ai,t(^T^Al(.P)^ ^^-vs 

\DI{tp)\ _ \DI{p)\ _ \DI{tp)\ 
\DI{tq)\ \BI{q)\ |D/(j, + o(l)) | 

where the o(l)-term is uniform in q in the given range and p in Vai- 
Assumption (5.10) is trivial since 

mtp)\^ iD/(p)p 

by homogeneity of /. 

Since tai{-) vanishes on "Dai here, (5.11) holds as well as (5.13). 

We check (5.12) by rescaling. Indeed, for q in or equivalently, 
tq in A^^M, we have 

Identifying TtqA^ j^ and TqAi^t, we also have 

'^AtAiq){v) = i7rAi,*(g)(t;A) . 

But |g — 7r^^((/)| tends to uniformly in Ai^t as t tends to infinity. 
Since the differential iTAi,*{'^Ai{q)) is the orthogonal projection onto 
TpVAi — this comes form the fact that ttai {p) = P for all p in Va^ , 
and that T^Alip) orthogonal to T>Ai — (5.12) follows. 

We are left to check that in (5.1), P^t is indeed a dominating 
manifold for the set AtCiT J(^A)+CA^,M and that (5.2) holds. If /c = 0, i.e., 
Vai is made of a finite number of points, (5.1) is clear. Assumption 
(5.2) is then checked by rescaling, using the fact that Ai^t shrinks 
around Va^- Assume 1. Define 

^ = Atn{ i;{exp^{u), s) : x e Va, , u e TpKj^AtpTpVA, , s ^ } . 
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(k] (k) 

Then, is a dominating manifold for = tA\ ' . From what 

we have done, we can apply Theorem 5.1 to obtain an asymptotic 
approximation for /.(fc) e^^^^^dx, and the result is nothing but the 

statement of Theorem 7.1. Instead of checking (5.1), it now suffices 
to prove that 



^(^)dar = o( / e-^(^)dx) as t 



oo , 



To apply Theorem 5.1 to evaluate the integral in the left hand side 
of the above inequality, we need to find a dominating manifold for 
At \ Ai'^K Pick V (fe) to be dVA^- It is a dominating manifold, of 
dimension at most — 1 for the set 

Af^ = {At \ )) fl { i;{x, s):xe W^^^^,,, } . 

Now, it is possible that At \ A^''^ or ^^'^"^^ do not have smooth 
boundaries. But since dAi is smooth, the set Ai'''^^ can be included 

(k—l)> 

in a set Aj. ' say, with smooth boundary, and for which W (k) 

Ai\A^ 

is again a dominating manifold. This is done in parameterizing 
r^(fc-i)(p) for p in tV^^^^ik) so that T^(fc-i)/ and T^(fe-i) coincide on 

the projection of A^'' on ^^i[At) fo^' instance, and extending r^(fe-i) 
into a differentiable function on Aj^^At)- This way, we need to estimate 
the integral over A^^^^^' , for which we have a {k — l)-dimensional 
candidate for a dominating manifold. Iterating this process, we go 
down to a dimensional dominating manifold, apply Theorem 5.1 
in this case, and obtain the order of all the terms with dominating 
manifold of dimension between and k — 1. In particular, this implies 



and this proves Theorem 7.1. ■ 

7.3. REMARK. It is essential to notice the following. In checking 

(5.3), (5.9), (5.12), the only feature we used besides homogeneity of 
/ is that a point q in Ai^t converges to TTAiiq) as t tends to infinity, 
uniformly in Ai^f For (5.7) and (5.8) — actually (5.16) — we used 
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slightly more, namely that |g — 7ryi^(g)| = o{t~^^^) in order to obtain 
(7.7). The conclusion is that if / is homogeneous, all that we need to 
do to check the assumptions is to check that A^^j^/t concentrates to 
T>At/t at the rate o(t~^/^). So, the work done in the proof of Theorem 
7.1 may save us some effort in other applications. 

7.4. REMARK. In many applications, the set Ai is of the form 
Ai = [x : g{x) ^ } for some smooth — twice continuously dif- 
ferentiable — function g, and I{A) is achieved at boundary points x 
such that g{x) = 0. In such cases, assumption (7.5) can be simplified 
into an analytical condition. To see this, define q = expp(r]v) and 
r = 4j{q,TAi{q)). Since 

thanks to Lemma 3.1.1, we have 

9{i^{q, TA^{q) + s))= gir) + s(pg{r), ^^W) + ^(s') 

= s{Dg,^){r) + 0{s'). 

For this expression to be nonnegative for s nonnegative, it is enough 
to have 

{Dg{r),DI{r))>0. 
Since / and g are smooth and q is close to p for small r], the condition 

inf {Bgip),BI{p))>0 

is sufficient for (7.5) to hold. Since a point p in "Dai minimizes / 
on Ai, the vectors J^g{p) and D/(p) arc positively proportional. The 
condition {Dg{p),DI{p)) positive holds provided Dg{p) is nonzero — 
notice that D/ does not vanish for / is strictly convex. In other words, 
(7.5) is fulfilled as soon as g has no critical points on T>Ai ■ 

Applying Corollary 5.2, we can obtain results on conditional 
distributions. The following statement asserts that the probabil- 
ity measure with density propositional to 1^^ (x)e~^(*^) converges 
weakly* as t tends to infinity to the probability measure abso- 
lutely continuous with respect to Mva > ^-^d density proportional 
to |D/|-('^-'=+i)/2(detG^J-V2. 
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7.5. THEOREM. Under the assumptions of Theorem 7.1, if B is a 
set of continuity of Mva^ j then 



Proof. Wc just need to check (5.18)-(5.20). Here, we consider 
^At = t- The measure in (5.18) rcan be rewritten as 



|D7|^(detGAi(g))^ ^ -^^1 |D/|^(detG^,(g))^ 

and does not depend on t. 
Since 



XA,\mtp)\ t-\^i{p)\ 

assumption (5.19) is satisfied. 

To check (5.20), use rescaHng to obtain 

sup { ^^5^ : tq G 7r^l{tp) , TA,{tq) ^ c{t) } 

= sup {|g - p| : qe TT^lip) , taM ^ c(i)/i" } . 

Since A^^t shrinks around I'ai, assumption (5.20) holds true. Apply- 
ing Corollary 5.2 yields the conclusion. ■ 

Another way to formulate Theorem 7.5 is to say that the probability 
measures with density proportional to lAi(a;)e^^(*^) converge weakly* 
to the one with density proportional to \DI\^'^~^~^^^/'^{detGA'i)~^^'^ 
with respect to the Riemannian measure on Va^ ■ 

Notes 

The notes of chapter 1 contain references on Laplace's method. Also 
very much related to this chapter is the work of Breitung (1994) in 
a Gaussian setting. Theorem 5.1 is related to Hwang (1980). The 
Laplace method in dimension larger than one with a dominating 
manifold of minimizing points is developed in Barbe and Broniatowski 
(200?) , motivated by large deviation theory. 



8. Quadratic forms 
of random vectors 



In this chapter, wc ilhistrate the use of Theorems 5.1 and 7.1 to 
deal with the following question. Consider a random vector X = 
(Xi, . . . , X^) in M"', and a real d x d matrix C = {Cij)-^^^- What 
is the decay of P{ {CX, X) ^ t} as t tends to infinity? 

Of course, this decay depends on the distribution of X as well as on 
the matrix C. We will deal with two different types of distributions: 
symmetric WeibuU- and Student-like. The Weibull-like tail will be 
handled through application of Theorem 7.1, while the Student-like 
one will be handled by a change of variable technique and Theorem 
5.1. 

8.1. An example with light tail distribution. 

Consider a random vector X = {Xi, . . . ,Xd) in W^, having density 
/ = for some convex function 7 on M*^. Assume moreover that I 
is a-positively homogeneous. Writing 

At = {xeR'^ : {Cx, x)^t} = VtAi , 

we see that 

P{ {CX,X) >t]= ( e-^(=^)dx, 
J At 

and Theorem 7.1 is relevant here. 

To be more specific, assume that the components X^ of X are 
independent and identically distributed, all with density 

Wa{x) = exp , a; e M , a > 1 . 

21(l/aj \ a / 

To apply Theorem 7.1 we need to describe the points of dAi at which 

I{x) = - V kil" 

Q! — 
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is minimum. Surprisingly, this problem seems quite difficult, and I 
have not been able to solve it in general. The result will rely on the 
following conjecture. 

8.1.1. CONJECTURE. If a 2 and C + has no vanishing 

eigenvalue, then X]i<j<(i admits a finite number of minima in 
Ai; moreover detG^i is not null at these minima. 

Hence, if this conjecture is indeed true, T>a^ is of dimension A; = 
when a / 2 and C + C"^ has no degeneracy. 

The application of Theorem 7.1 is then trivial. For a 7^ 2, we obtain 

as t tends to infinity — recall that At = ViAi here and not tAi as in 
chapter 6 — where 

ci = (27r)^ 5] |D/(x)|-(<^+i)/2(det(S'Aj-'/^ 
xeVA^ 

The term in ci can be made more explicit. Indeed, 



l<i<d 



for a > 1. Moreover, the remark following the statement of Theorem 

7.1 asserts that Gai is obtained by the difference of the fundamental 
form of ^i(Ai) dAi. Since D^/(x) = (a — l)diag(|a;i|"~^), we 
have 



nA,(^i)(^) = Proj^.A,(^,)^7(^diag(|a;ir ') 
while 

For a = 2, the calculation can be done explicitly. Let A be the 

largest eigenvalue of C+C'^ , and assume that A is positive — otherwise 
{Cx, x) is nonpositive for any x and P{ {CX, X) ^ t} is null for any 
positive t. Let 

H = {x: {C + C'^)x = \x} 
be the eigenspace associated to the largest eigenvalue A. 
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8.1.2. THEOREM. Let k be dimH — 1 and M be the compression of 
Id - A"^(C + C'^) to H^. For a = 2 and X positive, 

<^^- ^> * " ~ A(>-.)/^r(4)(detM)V. 

as t tends to infinity. 

Proof. Assumptions (7.1), (7.2) and (7.3) hold. To check (7.4), we 
only need to calculate . We claim that is the sphere of radius 
\/2/X centered at the origin in H. Indeed, if {Cx, x) = 1 then 

2 = {{C + C'^)x,x) ^X\xf. 

So |a;| ^ y^2jX. On the other hand, if {C+C^)x = Xx and |xp = 2/A, 
then {{C + C'^)x,x) = X\x\'^ = 2, and then {Cx,x) = 1. 
Applying Theorem 7.1, we obtain, 

/ e-N'/^dx ~ cie-*^(^^)t^ ast^oo, 

JAt 

and I{Ai) = 1/X from the preceding argument. To calculate the 
constant ci, notice that in our case, D/ = Id. Thus, |D/(x)| = \/2/X 
on Vai ■ 

To calculate Gai, observe that T^A^^^^ = {x}"*", for the level lines 
of / are spheres. Thus, for x in Vai , the second fundamental form of 
^i{Ai) at X is 

^ ^ P'^oj{a;}^IdL ,x fx . 

"^""nA^^^) = ^^f^\ = V 2 ^'"JW^^'^lw^ • 

On the other hand, the second fundamental form of dAi at some x in 
Vai is 

Proj{,|x(C + CT)| 1 , TM 

na..(.) = |(^^^T),| = ^Proj,.,.(C^ + C-)|^^^. . 

Clearly, since Vai is a sphere, its tangent space at a; is { a; }-*- fl It 
follows that for x belonging to T>Ai , 

T^Ai^A,)envA, = {x}^e{{x}^nH) = {x}^eH = H^ 
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since a; is in as well. Thus, for x in Va-^ , 

{C + C^) 



Gai (x) = Proj 



X 



This matrix does not depend on x. Therefore, putting all the pieces 
together, 

(2^)'^Vol(S'//(0, ^/2/X)) 
(2/A)^(A/2)^det(Proj,x(ld- i^)|^J^ ' 

Using the classical fact 

Vol(5n_i) = , 

we obtain 

(27r)'^/2 



Cl 



A(^-i)/2r(^)det((ld-(^^) 



N 1/2 



The result follows after dividing ci by (27r)''/^, the normalizing factor 
of the d-dimensional standard normal density. ■ 

REMARK. It is interesting to notice the discontinuity in the poly- 
nomial term in t in Theorem 8.1.2. In (8.1.1), this term has degree 

— {a — 2) J — J, which is strictly less than —1/2 for a ^ 2, and equals 

— 1/2 for a = 2. For a = 2, Theorem 8.1.2 gives a polynomial term of 
degree at most since k is at least 1 . Notice also that the map C ^ k 
is not continuous for any standard topology on the set of matrices. 

The determinant of M involved in Theorem 8.1.2 can be given more 
explicitly as a function of the matrix C . Write Ai ^ • • • ^ for the 
spectrum of C + C"^ . Since the dimension of if is /c + 1, we have 
\d-k = ^d-k+i = • • • = Arf. Diagonalizing C + C'^ and noticing that 
H-^ is invariant under Id + A^^(C + C^), we have 



detM= n (l-^)- 



Applying Theorem 7.5, we obtain also the following. 
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8.1.3. PROPOSITION. For a = 2 and A positive, the conditional 
distribution of X/\/i given {CX,X) ^ t converges weakly* to the 
uniform distribution over the sphere centered at the origin of radius 
^yi/X ofH. 

Proof. Notice that Mt>a^ proportional to the uniform distribution 

on Sh{0, and that |D/| as weh as det Gai are constant on ■ 

The result follows from the proof of Theorem 8.1.2. and Theorem 7.5. 
■ 

8.2. An example with heavy tail distribution. 

In this section, we consider a random vector X = {Xi, . . . , X^) in M*^, 
with independent and identically distributed components, all having 
a Student-like distribution with parameter a. Thus, Xi has a density, 
and there exists a constant Kg^a such that 

P{X,^^x]^P{X,^x}^^^ 

as X tends to infinity. Define 

ylt = { z G : (C.T, ,t) > t } = ^tAi . 

Writing Sq,(-) for the density of a single Xj, the density of the vector 
X, given by ni<j<d '^^(^i)- is not log-concave. It is not even 
specified at all, except by an asymptotic equivalent! Thus, we cannot 
use Theorem 5.1 in a straightforward way to approximate 

P{ {CX, X);^t}= I Y[ Sa{Xi)dXi . 

However, as pointed out in the introduction, chapter 1, we can make 

a change of variables, and then try to use Theorem 5.1. This will 
require all the power of Theorem 5.1, in particular the freedom on the 
set A that is allowed. 

To state our first result, define 

Ji = {j:C,,,>0}. 

8.2.1. THEOREM. Let X be a d- dimensional random vector with 
independent and identically distributed components having a Student- 
like distribution. Let C be a d x d matrix. Lf Ji is not empty, then 

P{ {CX, X)^t}^ ast^^. 
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Proof. Let us first make a change of variable so that we will be 
able to apply Theorem 5.1. 

Let Y = (Yi, . . . ,Y(i) be a random vector with centered normal 
distribution, with identity covariance matrix. Its density, 



_\y\ 



is log-concave. Let us write 



/y 
-ex 



-oo \/27r 

the normal cumulative distribution function. Similarly, denote by 



/y 
Sa{u)du 
-oo 



the Student-like cumulative distribution function of each individual 
Xi. Writing 

Sai'^) = inf { y : Sa{y) > u} 

for the inverse function of Sa and analogously for the inverse 
function of we see that S^o $(1^) has the same distribution as Xj. 

NOTATION. Let us agree that a function g defined on R is extended 
componentwise to R'^, writing g{xi, . . . , xa) for {g{xi), . . . , g{xd)) . 

It follows that X has the same distribution as (CS'^o ^iY) , S'^ o 
$(y)). In other words, defining 

Bt = {yeM.'': {CS^^o ^y) , S^^o > i } = $^0 Sa{At) , 

we have 

P{{CX,X)^t}= / n ^aix^)dx, = / -^^dy. (8.2.1) 

Since 5*^0 $ is continuous and defined on the whole real line, we see 
that for any positive M and any t large enough, the ball centered at 
the origin and of radius M does not intersect Bf. Thus, Bt moves to 
infinity as t tends to infinity, and the right hand side of (8.2.1) is the 
integral of a log-concave function over a set moving to infinity as t 
tends to infinity. We can try to apply Theorem 5.1. 
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It should be noticed that we could make a change of variable leading 
to a different distribution than the standard Gaussian one. However, 
this one is rather convenient since its level sets and their geodesies are 
known explicitly. 

The disadvantage of the change of variable is of course that the set 
Bf is more complicated than Af. Nevertheless, whatever information is 
needed on Bt can be first read on At, and then pulled back to Bf. This 
fact is illustrated by Proposition 8.2.4 bellow, where we will calculate 
This change of variable technique works mainly because ^*~oSa 
has an explicit and simple asymptotic equivalent. 

To apply Theorem 5.1, let us define 

m = ^ + log(27r)'^/^ , 

that is minus the logarithm of the Gaussian density. The function I 
is convex. 

We will make use of the following elementary result in asymptotic 
analysis, whose proof can be found in appendix 1, 

{^'-oSa{x)f = 2alogx-loglogx-21og(X,,„a°/^)-21og(2V^)+o(l) 

as X tends to infinity. It implies 

$*~o Sa{x) = \/ 2a log X + o(l) as a; ^ oo . 

It is also convenient to introduce the canonical basis ei, . . . , of M''. 

For any J in Ji and e in { — 1, 1 }, the point Pe,j,t = ^\/t/Cjj ej belongs 
to dAt- Thus, Qej.t = SaiPe,j,t) bclougs to OBt- The following 
lemma gives a parameterization of dAf and dBt near Pe,j,t and qe,j,t- 
This describes these boundaries locally. 

8.2.2. LEMMA. The tangent space of the boundary dAt at p = p^j^t 
is { (C + C'^)p I"*". Near p, the boundary of At can be parametrized as 



e,- + V 



for V in TpdAt, and \v\ = o{Vt) as t tends to infinity. 

The boundary dBt near q — qe.j,t = ^*~o Sa{p) can be parametrized 

as 



t loglog\/t log(A;,,„Q«/22^) 

m=qe,j,t{v)=e[. alog- 



+ 



Cjj 2\/a logi v^ologi 

^ ^ ^ l<i<d 
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for V in TpdAf and \v\ = o{-\/t) as t tends to infinity. 

Proof. The assertion on the tangent space of dAt at p is plain since 
the differential of the map x {Cx, x) at p is (C + C'^)p. Near the 
point ^JY[(\i^ we can parameterize dA\ by its tangent plane. This 
leads to the following parameterization of dA^. Let h{v) be such that 



for all V in TpdAt with \v\ not too large. This inclusion becomes 

t = {Cp{v),p{v)) = t{l + h{v)f + {Cv,v) . (8.2.2) 

An approximation of h follows either by working out an asymptotic 
expansion for /t(-) or using the following argument. For \v\ = o{\/i), 
(8.2.2) implies h{v) = o(l) as t tends to infinity. Rewriting (8.2.2) as 
the quadratic equation in h, 

= th{vf + 2th{v) + {Cv,v) , 

we obtain 

{Cv, V) \ 1/2 



M.) = -i + (i-^)' 



as i ^ oo . 



This gives the asymptotic expansion for p{v). 

Wc then pull back the expression of p{v) to parameterize dBt by 
q{v) = Sa{p{v)). Notice first that 



log U ^ (1 + 0(t->|2)) ) - ^ log ^ + 0{t-'\v\') . 

\ y 3}J / 3i3 

In the range |f| = o(\/t), the asymptotic expansion for in 
Lemma A. 1.5 gives 



r r log log log(i^,,„a"/22V^) 

' cx log -1^=^= — ' 

Cjj 2-v/aIogI V« log t 

+ o(log *)-'/'. 
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On the other hand, for j, 

e{Q{v),ei) = $*"o Sa{vi) . 

This proves Lemma 8.2.2. ■ 

It is somewhat important for what follows to have some intuition on 
the shape of dBt near qe,j,t- This is precisely what the last assertion of 
Lemma 8.2.2 gives us. Recall that p^j^t is collinear to ej. As v varies 
in Tp^,,,Mi = {{C + CT)e, the' term Zl^^^d;^^J ° S^{v,)e, 
in the expression of q^j^t{v) varies too. If the iij's were allowed to 
vary independently, then X^i^i^^.j^j Sa{vi)ei would describe the 
hyperplane { ej }-*-, and dBf would be a hyperplane perpendicular to 
Bj, passing through qe,j,t- This is not quite the case of course, but 
almost, provided we look at the right scale. This is the meaning of 
the next claim. 

8.2.3. CLAIM. Fort large enough and j in Ji, the set 

{ Yl Sa{vi)ei ■.v±{C + C^)ej , |^;| = o{Vt) } 



is contained in 



I WiCi : \w\ ^ ^-s/alogt I . 



Proof. One may argue as follows. Notice first that { (C + C'^)ej }-*- 
does not contains Ej. Indeed, if this were the case, we would have 
Cj^j = {{C + C'^)ej ,ej) = 0, contradicting the fact that j belongs 
to Ji. Consequently, as v varies in { (C + C'^Jej}^, the vector 
^i<i<d-i^j Sa{vi)ei describes the space spanned by the Cj's for 
1 ^i ^ d and i ^ j- Finally, if w is orthogonal to Cj and of norm less 
than (l/2)-v/alogi, then w = Xli^i^d iT^j <I>*~o Sa{vi)ei for some v in 
{C + C'^)ej. Furthermore, o Sa{vi)'^ ^ flogt. From Lemma 
A. 1.5, we then infer \vi\ ^ t^^^ for t large enough. The relation 
V ± {C + C'^)ej forces then \vj\ ^ 0{t^/^), and so = o{^/t). 
This proves our claim. ■ 
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We can now locate the interesting minima of / over dBt. They 
will provide a good guess for a dominating manifold, as well as an 
estimation of I{Bt). 

8.2.4. PROPOSITION. Assume that Ji is nonempty. If y belongs 
to dBt and I{y) ^ I{Bt) + 0(1) as t tends to infinity, then y is in 
a 0{1) -neighborhood of a points q^j^t for some j in Ji and some e in 
{—1,1}. Moreover, as t tends to infinity, 

HQe,j,t) = I logt - ^ log log ^ft - log(i^,,aa"/'2V^) 

+ log(27r)'^/2-|logC,-, + o(l). 

Proof. By the very definition of / and Bf, 

= inf { |y|V2 : (CS^o $(y) , S^o ^ t } 

=^ inf { ^*-o Sa{\Viuif : (Cm , = 1 , A ^ 1 } 

J- inf { S^{^rtuif ■.{Cu,u) = l], 

the second equality coming from the change of variable j/j = o 
Sa{^y/tui); the last one comes from the fact that the function A G 
[0, oo) Saiy^Viuif' is increasing for t large enough and Ui 

fixed. 

If s is positive and such that \/ts tends to infinity and logs/ log i 
tends to as t tends to infinity, the asymptotic expansion for ($*~o 
ill Lemma A. 1.5. shows that 

^'-o Sa{^ftsf = a\ogt{l + o{l)) ast^oo. (8.2.3) 

If {Cu,u) = 1, and r of the Uj's, say ui, . . . ,Ur, are of order larger 
than 1/logt, i.e. mini^j^^ \ui\ ^ 1/logt as t tends to infinity, then 
(8.2.3) yields 

Y SaiViuif ^ Y ^^°Sa{Viuif -^ralogt (8.2.4) 
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as t tends to infinity. Hence, to minimize the left hand side of (8.2.4), 
we should have r as small as possible. But r must be at least 1, for 
{Cu,u) = 1. Moreover r = 1 can be achieved by considering j in 
Ji and u = eej/y/Cjj for some e in { — 1, 1 }. This leads us to look 
at the function / near q^j^t = Soi{e\/tej / \/Cjli) . Furthermore, 
if /(<I>^o Sa{\/iu)) is minimal, u must be on the boundary of a 
0(logt)~^-neighborhood of ^e-jl^/Cjj for some e in {—1,1} and j 
in Ji. Consequently, y/tu is in dAt and in an 0(v^/logt) = o(v^)- 
neighborhood of Pe,j,t- Therefore, when studying such a point, we can 
use the parameterization given in Lemma 8.2.2. This also leads us to 
look at the function /(^^^^^(t;)) for v in Tp^.^dAt and \v\ = o(\/i). 

Write Vi = {v, Cj) for the components of the vector v belonging to 
^Pej.t^^t- Using Lemma 8.2.2, we obtain 

l{Qe,j,ti^)) = ^ log - ^ log log Vi - log{Ks,aa^^'2V^) 

+ ^ E ^^o5a(i;i)' + log(27r)'^/2 + o(l) (8.2.5) 

as t tends to infinity, and uniformly in 1^1 = o(\/t). Therefore, up to 
o(l) as t tends to infinity, the function v v-^ ^{Qe,j,t{v)) is minimum at 
0, and its minimum value is I{qej^t) as claimed. ■ 

Notice that the proof of Proposition 8.2.4 gives actually a little 
bit more, and this will be useful. Indeed, if x is in dBf and 
I{x) = I{Bt) + o(loglogt), then (8.2.3)-(8.2.4) and Lemma A.1.5 
show that a; is in a o(loglogi)-neighborhood of some qe,j,t- Indeed, we 
must have 

max \<^^oSa{vi)\ = 0(loglogi)^/2 . 

In view of Proposition 8.2.4 and its proof, we can start to apply 
Theorem 5.1 in calculating a few terms of the asymptotic formula. 
Indeed, define 

71 = max Cjj. 

We have immediately 

I{Bt) = I log t - ^ log log Vt - log(i^,,«a"/^2 V^) 

+ log(27r)'^/2-|log7i + o(l) 
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as t tends to infinity. Moreover, a candidate for the dominating 
manifold is 

{qe,j,t:j&Ji,ee{-l,l}}. 

Unfortunately, this choice does not match with our definition of a 
dominating manifold. It is indeed required in the definition that it is 
also a base manifold, and as such belongs to ^i(Bt)- The expression 
I{y) = (|yp/2) + log(27r)<^/2 and Proposition 8.2.4 shows that the 
points qe,j,t, for j in Ji, cannot lie on the same sphere centered at the 
origin. But they almost do! 

Let us denote by pt the radius of the sphere Ajf^^^y The ex- 
pression for / shows that pt = ^/l{Bt) + (ilog(27r). Define r^j^t = 
Pt<le,j,t/\(le,j,t\- We consider the candidate 

T>Bt = {re,j,t : j e Ji, ee {-1,1}} 

for a dominating manifold. It will be clear after Lemma 8.2.7 that 
what we are really doing here is moving the points Qej^t through 
the normal flow, until they reach the level line Aj(^Q^y, this gives 
'^ej,* — somehow unfortunately for the clarity of the argument, but 
luckily for the calculation, this move along the normal flow and the 
Euclidean projection on the sphere coincide when working with the 
normal distribution. 

Since is of dimension 0, its Riemannian measure is a sum of 
point masses, 

jGJi 
ee{-l,l} 

From Proposition 8.2.4 and the above expression for I{Bt), we infer 
that for j in J\ , 

TBt {re,j,t) = fog 7^ + o(l) as i ^ oo . 

Since D/ is the identity function and I{y) = -i^- + log(27r)'^/^, we also 
have 

\BI{re,j,t)\ =Pt = V'^IiBt) + dlog{2Tr) ~ y^alogt as i ^ oo . 
If we can apply Theorem 5.1, we obtain 



8.2. An example with heavy tail distribution 



101 



JBt 



-I{Bt)f2T^\{d-l)/2 exp( TBt{qe,j)) 

,t/. |D/(r,,,,)|(<^+i)/2(detGB,(r.,,,))^/^ 



eG{-l,l} 



K^^a^'-'-^y^ y ^ ^ (8.2.6) 

^ iGJi (detGB,(r,j,t)) 

^€{1,1} 



as t tends to infinity. Wc arc left with ealeulating GBt{fe,j,t) and 
checking the assumptions of Theorem 5.1. In order to calculate 
GBt{fe,j,t), we need to calculate TBt, and ultimately the normal flow. 
This turns out to be particularly easy for the normal distribution. 

8.2.5. LEMMA. For the Gaussian distribution Af{0, Id) onW^, the 
normal flow is given by ^{q, s) = ^1 + q- 

Proof. The level lines Ac are spheres centered at the origin since / is 
a spherical function. Hence, ■il){q, s) moves on a straight line through 
the origin as s varies, and ip{q,s) = a{s)q for some function a(-). We 
obtain a from the equation 

liQ) + « = ^ + log(27r)'^/2 + , ^ ^ ^ |^|2 + log(27r)<^/2 . 



That is, a{s) = ^1 + -^. , 

Since the exponential map on the level line Aj(^Bt) involved in the 
definition of the curvature term G, we first recall its expression in the 
Gaussian case. 

8.2.6. LEMMA. If I is a spherical function, then for q in Ac, we 

have 

(i)T,Ac^{q}^, 

(a) expq(w) = cos (1^)'? + "^^^ ^ ^g^c with 

\w\ ^ ttIoI. 



102 



Chapter 8. Application to quadratic forms of random vectors 



Proof. Since Ac is a sphere centered at the origin, (i) follows. The 
geodesies on Ac are circles of maximal diameter. By cutting Ac along 
the plane determined by q and w, the expression of the maximal circle 
leaving q in the direction v, that is (ii), follows. ■ 

Since we calculated I{qej,t) in Proposition 8.2.4, it is easier to 
calculate r^^ ( exp^^ . ^ (f )) than tb^ ( exp^^ . ^{v)y The following lemma 
will be instrumental in relating these quantities. It is specific to the 
Gaussian situation. Since T^A/^g) = {g}"*", we can identify TqAjf^g-^ 
and TxqAj(^xq) nonzero A. 

8.2.7. LEMMA. For the standard Gaussian distribution, i.e., I{x) = 
|x|V2 + log(27r)''/2, 

(i) for any nonzero q, any positive A and anyw inTqAj^q^ = TxqAj^xq)> 
we have 

expAg(w) = Aexpg(w/A) ; 

(ii) moreover, for any set B, any positive \, and any q in such 
that the line segment between Xq and q does not intersect B, 

rB(Ag) = M!(i-A2)+rMg). 



Proof. (i) follows from Lemma 8.2.6 since 

exp,,(«.) = cos (M) + sm (M) |A,|^ = Aexp, Q . 

To prove (ii), the condition that the segment between \q and q 
does not intersect B, the fact that the normal flow moves along 
straight lines through the origin, and the definition of tb imply 
i^iQiTsiQ)) = i'iMjTBiXq)). The expression of the normal flow in 
Lemma 8.2.5 gives 

The formula for tb{M) follows. ■ 

REMARK. The essence of Lemma 8.2.7 is to relate exp^^^ CIV'S,* 
and tps o exp^. Both maps act on TqAj(^qy We can write one as the 
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other one composed with some transform of Tqkn^qy Lemma 8.2.7 
makes this exphcit in the Gaussian case. 

We can obtain an approximation of TSt provided its argument is not 
too far away from g^j^t for some j in Ji and some e equal to —1 or +1. 

Since wc made everything exphcit up to o(l), our approximation wih 
not be good enough to check (5.3), but perfectly fine to check (5.17) 
— one may try to check (5.3) and hopefully will agree that (5.17) is 
a useful refinement. Hence, we are ready to calculate the curvature 
term 

8.2.8. LEMMA. In the range \w\ = o{^/logt), we have 
TB,{exPr,j,tH) = fB,(exp^^ ._^(w)) + o(l) , 

with 

Consequently, 



Proof. Let us first obtain an approximation for ( exp^^ ^ ^(tu)) . 
Lemma 8.2.2 shows that in the range |ti| = o(\/t), near qej,t, the 
surface dBt parametrized by v q^j^tiv), is given by the equation 
of the hyperplane {x,ej) = l^e.^,*!, up to o(logt)~^/^. Consequently, 
for q = qe,j,t, for w in TgAj(Bt) and s = TBt{e^Pq{w)), using Lemmas 
8.2.5, 8.2.6, we obtain 

|g| +o(logi)-^/2 = (V'(expq(w;),s),ej) = ^'l + |^ cos (^) kl • 
It follows that 



-l/2\ 2 



|g| +o(log^) 
|g|cos(|u;|/|g|) 



\w\ 



+ o(l) as t ^ oo . 
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in the range \w\ = o{\q\) = o(logt)^/^. Prom Lemma 8.2.6, we deduce 
TB, ( exp^^ (w)) = TB, [-^ expg 

Therefore, since TBt{q) = o(l), 

TBt ( exp^^ iti^))- ^Bt {re,j,t) = TBt 



exp 



\w\ 



+ 0(1). 



This is the result, setting 



f^,(exp,^,^,^H) = ^-i-^ + ^. 

The second statement follows since we proved the asymptotic 
equivalence |D/(rgj,t)| ~ ^/alogt. ■ 

Combining Lemma 8.2.8 and result (8.2.6) yields the asymptotic 
equivalence given in Theorem 8.2.1. It remains to check the assump- 
tions of Theorem 5.1. 

Our choice oiDst ^ discrete set ensures that (5.1) holds. 

We now need a candidate for CBt,M- Let c(t) = 2d\og\ogt. From 
Proposition 2.1 and our calculation of I{Bt), we infer that 



L{l{Bt) + c{t)) ^ 



t«/2(logt)° 



- 0(1) = o(r"/2) as t ^ oo . 



Thus, c{t) is a good candidate for cb^.m^ matter what M is. 

From the proof of Proposition 8.2.4 and Lemma 8.2.8, we infer that 
for any t large enough, 



^t^M C { exp^^^,^(i«) : \w\ ^ ^JM\og\ogt\ 

e?;,,,-.*Aj(B^),eG{-l,l},iG Ji}. 



w 



i.2.7) 



Since the level set ^i(Bt) * sphere of radius ^/2I(Btj{l + o(l)) ~ 
V^alogt as t tends to infinity, its radius of injectivity is of order 
Vlogt ^ "v/Ioglogl, and assumption (5.2) holds. 
Assumption (5.4) holds thanks to our choice of c{t). 
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To verify (5.5), let r be a point in ^. The point q = ijj{r, TBt{r)) 
is in the boundary dBt- Since TBt{r) is less than c(t) = 2(iloglogt, 
the proof of Proposition 8.2.4 and (8.2.5) show that q = q^j^tiv) for 
some e in { —1, 1 }, some j in Ji and 

<!>^oSa{Vif ^ Idloglogt. 

Consequently, Lemma A.1.5 shows that \vi\ ^ (logt)^*^/", for 1 ^ i ^ 
d with i 7^ j. Since u is in { (C + 0^^)61 }^, and this hyperplane does 
not contain e^, we have = 0(logi)^'^/°. Notice that 



tp{r,TBt{r) + s) = ip{q,s) = 




and that |g| ^ \r\ = pt tends to infinity with t. To prove that ^ pj 
for instance — which is more than enough to guarantee (5.5) — it 
suffices to prove that \q is in Bt for any 1 ^ A ^ 2. This is plain from 
Lemma 8.2.2 and Claim 8.2.3. 
Assumption (5.6) is plain. 

To check (5.7), notice that for q in Psj, the inequality to^Miq) ^ 
v^5dlogTogt holds thanks to (8.2.7). Furthermore, as D^/(g)/|D/(g)| 
equals Id/|g|, 

K^^iqM ^ sup { \qr : 1 ^ Aj^b,) } - ^ - ^ 

as t tends to infinity. Therefore, (5.7) holds. 

Assumption (5.8) holds as well since tTj^^{p) is essentially a finite 
union of spherical caps, and the Ricci curvature of a sphere is positive. 

Assumption (5.9) is trivially satisfied since Ac is a sphere. Thus, 
two points in Ac have equal norms. 

It is no harder to verify (5.10), since 

||D^/(;j)|| _ pc^ _ 1 

WW' WW 

To check assumption (5.11), again, we have ||D^/|| = ||Id|| = 1. 
Moreover, if q belongs to B^ f^^ C A^^Bt) ^^'^ ^ is nonnegative. 



D/(V«((?))|' ^ |D/(g)|2~2/(i?t)~alogt. 
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Consequently, for q in B^^m > that is tb^ {q) is less than c{t) , and for t 
large enough, 



L 



-s.(^)p327^ , ... ^ 2c{t) 4dloglogt 



Since T>Bt is discrete, (5.12) holds automatically. 

In conclusion, all the assumptions of Theorem 5.1 are satisfied. This 
proves Theorem 8.2.1. ■ 

From the work done, we can easily infer the following conditional 
result. 

8.2.9. THEOREM. Under the assumption of Theorem 8.2.1, if Ji is 
nonempty, the conditional distribution of the vector 

ilogVt)-\sign{Xi)log\Xi\)^^.^^ 

given {CX,X) ^ t converges weakly* to 



5^ (-1O1/2 



Proof. In order to apply Corollary 5.3, let us check its assumptions. 
Set = Pt- The numerator of the measure involved in (5.18) is 

1/2 • 



E E- 

|D/(r,,,, ,) I (-^^D/^ (det Gb, {re,j,t)) 



We already calculated 

TBt {re,j,t) = I{qe,j,t) " I{Bt) = - log — + o(l) as i ^ oo . 

Moreover, as r^j^t is in the sphere ^i{Bt) and D7 = Id, we have 

\DI{rej,t)\ = Pt- Lemma 8.2.8 gives the value of det GBt{i^e,j,t) ■ 
Moreover, r^j^t/pt = Qe,j,t/\Qe,j,t\ = ecj. Consequently, the measure 
in (5.18) is " 

E.g{-i.i}E,.j,g;:f(l + o(l))^.e, 
2E,e..C-f(l + o(l)) 
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It certainly converges weakly* to 



V = 



a/2 

hi 



Assumption (5.19) is trivial to verify since Bj.^^ is on the sphere ^i(^Bt) 
of radius pt- Consequently, (5.19) becomes 

2d\og\ogt 2(iloglogt 
hm 7^ = hm = U . 

t^oo t^co a\ogt 

To check (5.20) is as simple. The inclusion (8.2.7) shows that if q 
belongs to B_^^j^; then the Riemannian distance on ^i(Bt) between q 
and TTBt is at most -v/MlogTogi. Since ^i{Bt) is a sphere of radius pt, 
simple trigonometry shows that Iq — T^Btiq)] ^ Pt sin(-^5d log log t/pt ) . 
Since pt is of order -y/a \ogt as t tends to infinity, assumption (5.20) 
is fulfilled. 



Applying Corollary 5.2, the distribution of Y/^/aTogi given Y G 
Bt converges weakly* to u. In other words, the distribution of 
Scl{X)/^/a\ogi given {CX, X) ^ t converges weakly* to i/. 

To rephrase this conclusion directly on X, we can use the Skorokhod 
(1956) representation theorem. It implies the existence of a random 
variable Yt having the same distribution as Y given Y e Bt, and 
a random variable Y^o having distribution u such that Yt/^/a\ogt 
converges almost surely to Y^o. Thus, X given {CX,X) ^ t has the 
same distribution as 

o *(Ft V^d^) = 5^ o V^d^(l + 0(1))) . 

Since o $ is ultimately sign preserving on R and Lemma A. 1.6 
yields 

log 



57o$(ee,V^bi7(l + o(l)) 

= ej \ogS^o^(^y/a\ogt{l + o(l))) 



logt , 



= e,-^(l + o(l)). 



the result follows 



A careful sharpening of all the estimates could certainly lead 
to more precise information on the conditional distribution of X 
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given {CX,X) ^ t, and even an asymptotic expansion of this 
conditional distribution. We will not pursue in that direction for 
mainly two reasons: First, such calculation would be quite specific to 
this example. Second, we will see hereafter in this section that the kind 
of degeneracy at the limit — the limiting distribution is concentrated 
on a finite number of points — is not due to a bad rescaling but only 
to the fact that J\ is nonempty. 

It may happen that all the diagonal coefficients of the matrix C 
are nonpositive, that is maxi^j^rfCj^j ^ 0. What is the analogue of 
Theorem 8.2.1 then? When 71 was positive, we could essentially set 
all the UiS but one equal to in order to optimize /(^'^o Sa{-\/tu)) 
— see the proof of Proposition 8.2.4 and inequality (8.2.4). When 
7i is negative, we need to take at least two components Uj, Uj to be 
nonzero. Setting = y/luiCi + y/tujej, the equation G dAt 
becomes 

1 = Ci,i + UiUj{Cij + Cj^i) + UjCjj . 
This equation admits a solution in Ui, uj if and only if 
(Cjj + Cj,i)^ — 4Ci^iCjj > . 

Consequently, if 

J2 = { j)--i^J, (Q J + Cj,,f - ACijCj,i > } 

is nonempty, we can indeed consider only two nonzero components Ui, 
Uj with in J2. 

How many components do we need to consider in general? To 
answer this question, it is more convenient to change the notation. 

Let T be the set of all subsets of {1,2 . . . , d} . For a set X = 
{ ii, . . . , ifc } with distinct elements, denote by \1\ = k its cardinality. 
To 1, we associate the subspace Vx = span{ , . . . , e^j, } of dimension 
To the matrix C, we associate 

Ar(C7) = min { |T| :Tg T, 3ueVi, {Cu,u) > 0} 

and 

J{C) = {I eT :3ueVi, {Cu,u) >0, |T| = A^(C) } . 

So, if N{C) = 1, the set J(C) is Ji. The integer N{C) is the smallest 
cardinal of a set I such that the inequation {Cu, u) ^ has a solution 
in Vi\{0}. We exclude some degeneracy, assuming that 

for any J in T of cardinal \I\ < M(C), the matrix C „ „n 

' ' ^ ^' 8.2.8 

IS negative on Vj, 
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that is (Cit, u) is negative for any nonzero vector u in Vx- This typi- 
cally prevents having 71 null, and the analogue when more components 
need to be considered. Notice that this nondegeneracy is typical with 
respect to the matrix C. In particular, the equation {Cu, u) = 1 has 
a solution in any subspace Vx for I in J(C), and has no solution in 
any subspace of the form span{ ea^, ■ ■ ■ , } k < -^(C*)- 

We keep the notation 

At = {xeW^ : {Cx, x) =t} = ViAi . 

The sets 

Mx={meVx:{Cm,m) = l}, I e J{C) , 

are — l)-dimensional submanifolds of and dAi as well. Indeed, 

it suffices to prove that 1 is a regular value of the map x £ Vx 
{Cx. x) . The differential of this map at m is (C + C^)m. If m belongs 
to Mx, 

{{C + C'^)m, m) = 2{Cm, m) = 2^0. 

Consequently, (C + 0^^)171 does not vanish, or, equivalently, the 
differential (C + C^)m is of full rank, and 1 is a regular value. 
The result is then as follows. 

8.2.10. THEOREM. Let X he a d- dimensional random vector with 
independent and identically distributed components with Student-like 

distribution with parameter a. Let C be adxd matrix and N = N{C). 
For X in J{C) and m in Mx, denote G{m) the compression of the 
diagonal matrix X^jgx^i ® ^i/ln^'i 

I to { (C + C^)m } n Vx. Under 

(8.2.8), fora> 2/N, 
as t tends to infinity. 

REMARK. The assumption a > 2/N guarantees that the integral 
over Mx in the equivalence is finite. But we will see that the 
result is true whenever the integral over Mx is finite. It is not 
clear whether a > 2/N is required, though a too small makes 
the integral diverge. This can be seen in Lemma 8.2.18 below. If 
N{C) = 1, Theorem 8.2.10 is exactly Theorem 8.2.1. Clearly, in this 
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case, the term K^^a°'^/H~°'^l'^ / \faN in Theorem 8.2.10 gives the 
term i<'^,„Q'(°-^)/2t-°/2 in Theorem 8.2.1. For N{C) = 1, we have 
J(C) = Ji, provided we identify {i} and i. If X = {i} belongs to 
J(C), then = Me,, and 

Mr = { X e M ei : {Cx, x) = l} = { -C^l'^ei, C^^'^ei } . 
Thus, the Riemannian measure on Mj is 

1,1 * 1,1 * 

Moreover, for m in Mj, the matrix G{m) is the compression of 
Bi (g) Cj/m? to { (C + C'^)m j"*" fl 1^. But, in our case, 

{{C + C^)m}^nVx = $, 

because, if x is in Vj, we have x = sei for some real number s, and 

{{C + CT)m , e,) = ±C-l'\{C + C^)e, , e,) = ±2C;f 7^ 

— the last inequality holds since we assume N{C) = 1 here and i in 
Ji, i.e., { i } is in J(C); we actually did the same work in the proof of 
claim 8.2.3. So, the term det(5(m) has to be omitted, and we obtain 

as in Theorem 8.2.1. 

Proof of Theorem 8.2.10. The proof is actually very similar to that 
of Theorem 8.2.1, except that the dominating manifold will no longer 
be a discrete set, and the parameterizations will be slightly more 
sophisticated. 

As in the proof of Theorem 8.2.1, we denote by Sa the cumulative 
distribution function of Xi . We consider the two sets 

At = {xeM.'^ : {Cx, x)^t} = VtAi , 

and 

Bt = {yeM'' : {CS^o<!>{y),S^o^y))^t}. 

Notice that if X and I' are distinct and in J{C), then Afj docs 
not intersect Mj'. Indeed, if x is in both AIj and Mj', then it is 
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in Vx n Vx> = Vini', and moreover {Cx,x) = 1. This contradicts the 
minimahty of N{C), since I different than I' implies |XnX'| < N{C). 

We first need a technical lemma, saying that whenever Z is in J{C), 
the manifold Mx stays away from any subspace { }-*- with i in 1. 
We denote by rrij = (m, e^) the components of the vector m belonging 
to W^. 

8.2.11. LEMMA. Under the assumption (8.2.8), there exists a 
positive eo such that for all Z in J{C), all m in Mx, and all i in 
Z, the inequality \mi\ ^ eo holds. 

Proof. Searching for a contradiction, assume that there exists Z 
in J(C), some index i in Z and a sequence m(n) in Mx such that 
oo'Tij(n) — 0. Write 

m(n) = mi{n)ei + s{n)v{n) 

where is a unit vector in Vx\{ij = Vx Q e^M, and s(n) is a real 
number. Dropping the index n for notational simplicity, the condition 
m = m(n) belonging to Mx becomes 

1 = {Cm,m) = mf{Cei , e^) + mis{{C + C'^)ei , v) + s'^{Cv ,v) . 

Assumption (8.2.8) guarantees 

sup { {Cv, v) w e Vx\{i} , |?;| = 1 } < . 

Thus, the above quadratic equation in s does not have any solution, 
since its discriminant is mf{{C + C'^)ei, v) — A{mfCi^i — l){Cv, v) = 
A{Cv,v) + o(l), which is negative as n tends to infinity; this is a 
contradiction. ■ 

In order to parameterize the boundaries dAt and dBt, we consider 
the normal bundle of the immersion Mx C dAi, namely, 

Mx= {{m,v) -.me Mx,v e TmdAi Q TmMx } . 

The analogue of the parameterization Pfj^t{v) of dAt in the proof of 
Theorem 8.2.1 is now a map defined on o(v^)-sections of the normal 
bundle A/j. 

It is convenient to introduce 

n—, l"glQg^/^ log(X,,^a"/22V^) 
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8.2.12. LEMMA. Let m be in Mj. The boundary dAt near s/im can 
be parameterized as 

Px,t(m,v) = Vtm{l - ^{Cv,v) + o{l/t)^ + v, 

{m,v) G A/j, \v\ = o{\/i) as t tends to infinity. The boundary dBt 
near <I>^o Sa{Vim) can be parameterized as gx,t("^)^) + o(logt)~-^/^ 
where 

gx,t(m,i;) = > sign(mi) Q(i)H — ]ei+ } $ oSa[Vi)ei 

and in the range \v\ = o{y/i), log|mi| = o{\ogt)^^'^. 

Proof. Since dAt = ^/idAi and dAi is a manifold, there exists a 
function h and some small positive e such that for any l^l ^ e^/i, any 
m in Mi and v in T^dAi Q T^Mj, 

Pi,t{m, v) = Vim{l + h{v)) + v e dAf . 

This equation can be rewritten as 

t{Cm, m) (1 + h{v)) ^ + Vt{l + h{v)) {{C + C^)m , v) + {Cv, v) = t . 

Since {Cm, m) — 1 and ((C + C'^)m, v) = — recall that v belongs 
to TpdAi = { (C + C^)m }^ — we obtain 

= th{vf + 2th{v) + {Cv,v) 

as in the proof of Lemma 8.2.2. Thus, h{v) ~ —{Cv,v)/{2t) as t 
tends to infinity, uniformly in l^l = o{Vi). This gives the asymptotics 
for px^f (m, v) in Lemma 8.2.12. 

We pull back this parameterization to dBt by introducing 

qx,t{'m,v) = $*~o Sa{pi,t{m,v)) . 

Lemma A.L5 implies 

^^osJVimi{l+\vfO{t-'))+Vi) = Q(t)+VZ^i^|M+o(logi)-'/' 

as t tends to infinity, uniformly in the range \v\ = o{Vi), log|mi| = 
o(logt)^/^ and |mj| ^ eo > 0. Since Lemma 8.2.11 ensures that |mj| 
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stays away from 0, the last condition, |mj| ^ eo, may be omitted in 
the statement of Lemma 8.2.12. ■ 

Define 

r./ X N(C) , N(C) , , r 
R{t) = — log t ^ log log Vi 

- N{C) log(i^,,aa"/'2^/^) + log(27r)'^/2 . 

Recall that I{x) = (|xp/2) + log(27r)'^/2. In the range \v\ = o{Vt) and 
log \mi\ = o(logi)^/^, it follows from Lemma 8.2.12 that 

/(,x,(m, v)) = Iy: (QW' + + 0(1)) 



2 

10 

R{t) + a^loglniil + ^ '^^oSaivif 

iei l^i^d (8.2.9) 

i0 



+0(1) 

as t tends to infinity, for all i in X. 
For a set 1 belonging to J{C), define 



7x = min | Y[ l^^il • ^ Mr | • 



iei 

Furthermore, set 

7 = inf (71 :X€ J(C)}. 

The following result locates the points on dBf where the function / is 
nearly minimal. 

8.2.13. LEMMA. We have 

I{Bt) = R{t) + a log7 + o(l) as t ^ oo. 

Moreover, there exists rj in (0, 1/2) such that for any number positive 
Ml, any M2 > Mi/ a, and any t large enough, the set 

dBt n Tj(^Bt)+Mi log log t 
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is included in 

U { gx,t(m, v):me Mj,\m\^ (logt)^^ , 
xeJ(C) 

\v\ ^ _ |Proj^x^;| ^ (logt)^2 I 

where Proj^^ is the projection onto V^. 

Proof. We first prove the second assertion of the lemma. By 
construction, $^ o Sa{d\/t.Ai) — dBf. Let u be a point in dAi. 
Lemma A. 1.5 imphes tliat for any r] in (0, 1/2) and provided t is large 
enough, 

l{^'-oSa{Viu))^a{l-ri) J] l^t-, ^^){\ui\) log {Vi\ui\) . 

Consequently, if 

/($^o SaiViu)) ^ R{t) + Ml log log i + 0(1) , 
we must have, for t large enough, 

(1 + r?)^^alogt ^at{l^i^d: \ui\ ^ t'^ } log(t(V2)-^) . 

Taking r] positive and small enough so that the integer part of 
Y^N{C) is iV(C), the previous inequality yields 

N{C)^^{l^i^d:\ui\^t-'>}. 

Then, since u is in dAi, the minimality of N(C) and Lemma 8.2.11 
implies that we must have 

N{C) = fi{l^i^d: \ui\ ^t'"'}. 

Thus, \/tu is in a 

^(1/2)-,? = o(v^)-neighborhood of v^Uje j(c) 
Therefore, it can be written as ]3i^t(m, v) for some T in J{C), some m 
in Mx and \v\ = o(i(^/2)-j?)_ 

Lemma 8.2.11 ensures that mj ^ eo wherever i is in some set I 
of J{C). Thus, for any positive e and t large enough, Lemma A.1.5 
implies 

l{<P^o Saipx,t){m,v)) 



^l^'^^oSa{Vi\mi\il-e)f 

^ a ^ log {Vi\mi\) - log log {Vi\mi\) + o(l) 

= (^^alogi + aJ^logK|)(l + o(l)) (8.2.10) 
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Therefore, the inequahty 

l(^^^oSa{pi,t{m,v))^ ^ i?(t) + Miloglogt + 0(l) 

implies a maxjgi log |mj| ^ Miloglogt(l + o(l)) for t large enough. 
So, for t large enough, log \m\ ^ M2 log log t for M2 larger than Mi/ a. 
Notice that M2 can be chosen independently of m in Mj since all the 
bounds are uniform in such m's. This proves the second assertion of 
Lemma 8.2.13, except for the restriction |Projy__L?;| ^ (logt)^^^ 

In the range obtained so far, (m, 1;)) has minimal value R{t) 

up to 0(1) as t tends to infinity. Hence, I{Bt) is as claimed. Finally, 
on this range, if |Projy^T;| ^ (logt)^^, then at least one component 

Vj, for some j not in T, is larger than {iogt)^'^ / ^fd. Consequently, as 
t tends to infinity, 

/($^o5„(pi,t(m,^;)) >i?(t) + alog7+ ]^<^^ o S^{vif + o{l) 

^ + a log 7 + aMs log log t (1 + 0(1)) , 

thanks to Lemma A. 1.5. So, provided M2 is large enough, the 
condition qx^i(m, f) belonging to ^ i(Bt)+M-^\o^,\o^t imposes to have 
|Projy_±f I less than (logt)''^^^ , 

As we did in the proof of Theorem 8.2.1, we can start to apply 
Theorem 5.1 in calculating the terms of the asymptotic formula. 
How do we choose the dominating manifold? 

Looking at the parameterization of dBt by qxf,{jn,v) in Lemma 
8.2.12, we see that variations of rijgil^jl °f order 1 yield small 
variations of order 1/logt on qx{m,v), i.e., in term of the Euclidean 
distance in W^, while, according to (8.2.9), they give variations of 
order 1 in term of /. This suggests that on the dominating manifold 
we should have Iliex l'^*! constant. Once m is fixed, (8.2.9) shows that 
fluctuations in space of order 1 in v yields fluctuations of order 1 on / 
as well. Keeping in mind that dBt is of order $"^0 ^^(v^) ~ V a log t 
as far as its size is concerned, we see that small fluctuations in v — on 
the scale of log t — brings sizable fluctuations of /. So, we should have 
V constant in the dominating manifold; and (8.2.9) suggests v = 0. 
This leads us to consider the set 

{meMx -Yllmil =7x} 
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of all points in Mj which minimize the product of their nonvanishing 
components. And then, one can try to choose the image of \/i times 
this set by <I>^o Sa as the dominating manifold, i.e., 

{ SaiVim) : m e Mx , JJ \mi\ = 7x } . 

This is not quite right since it does not belong to Aj(^^^^, but a 
projection would fix this detail. For some reason that the author 
does not quite understand — can someone give an explanation? — 
this does not work and breaks down when looking for a quadratic 
approximation of TBf The right manifold to consider seems to be a 
projection of 

= { ixim, 0) : m G Mx , max log \mi\ ^ (logt)^/^ , / G J(C) } 

on Ai(Bt)- The condition log|mj| ^ (logt)^/^ in the definition of 

T>'^^ guarantees that log|mj| = o(logt)^/^ and will allow us to use 
Lemma 8.2.12 and equality (8.2.9). Since A.j(^Bt) ^ ball of radius 



Pt = y2[l{Bt) — log(27r)'^/2), this leads us to introduce 

. ^ qj,t{'^,v) 

ri,t[m, V) = Pt-. -. ry , 

kx,t(m, v)\ 

and 

TlBt = I rx,t(m, 0) : m G Mj , maxlog \mi\ ^ (logt)^/^ , I G J{C) ] . 

Again, the projection on the sphere that we are doing is actually a 
mapping through the normal flow to the level set ^i^St)- more 
general situations, we would define rjAm,v) = i'Bt{QT,t{'nT',v), s) 
withs = /(5t)-/(gx,t(m,0)). 
Let us agree on the notation 

qx,t{'m) = qi,tim, 0) and rj^m) = rx,t{m, 0) . 

Define 

p{m) = \fa ^ sign(mi) log |mi |ei , m G Mx , T G J(C) . 

igX 

For any X in J(C) and any m in Mx with log|m| = o(logi)-'^/^, we 
have 

, , ^, , \ - . , , 'p{m) 
qxA^) = Q{t) ysign{mi)ei + — 
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as can be seen from Lemma 8.2.12. From Lemma 8.2.11, we infer that 
the vector "^i^j sign{mi)ei is constant on each connected component 
of Uje J(C) each of these connected components, we can think 

of the set of all qi,t{fn) — i.e., the connected components of — 
as a translation by a fixed vector of length ^/\T\Q{t) and a rescaling 
by 1/ \/Iogi of the corresponding connected component of the set 

Vj = {p{m):meMx}, I e J{C) . 

Following what we did in the proof of Theorem 8.2.1, we can start 
to apply Theorem 5.1. The value of I{Bt) is available from Lemma 
8.2.13. 

For m in Mj, the differential p^{m) is the restriction to T^Mx = 
{ (C + C^)m }^ n Vx of the matrix 

Vadiag(l/|mi|)jgj = ^/a^ei ^ ei/\mi\ . 

The change of variable m ^ Qi^tijn) gives 

AMv',^ = det(pJp,)^/'dMu,,,(c)M,(logi)(^-^(^»/^ , 

provided we integrate on the restricted range log|m| = o(logt)^/^. 
Since ri,t(m) is reasonably close to qx,t{m), we should be able to 
approximate the measure AM-Vg by AMxi' when applying Theorem 
5.1. 

From equation (8.2.9), we infer that 

TBt{qi,t{m)) = a^log|mi| - Q;log7 + o(l) as i ^ oo . 

To obtain the needed quadratic approximation for along the 
geodesic leaves orthogonal to TrVst, we first need to determine 
TrAj(^s^^QTr'DBf What should it be? Let us argue informally. Later, 
we will make the argument rigorous. First, we should be able to 
replace TrT>Bt by T^^ for some I in J{C) and m in Mj. Since 

"^Qi tim)^l(Bt) is tbe orthocomplement of gx^t(m)]R, we should expect 
Tq^,t{m)^l{Bt) © ^gi.tM^k to be roughly { qx,t{m) }^ T^^^^^^^V'^^. 
Given the expression for qx^tim), we have 

^gi.tM^Bt = Tp^mpx = p*{m)T„,Mx 

= { ^^ei yje(C + CT)mR| C Vx . 
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Intuitively, our idea of projecting on ^i[Bt) obtain T>Bt 

will work well if the projection does not create a singularity or 
reduce the dimension; in other words, if is transverse to the 
direction of qi,t{m) = Q{t)Y^-^jSiga.{mi)ei[l + o(l)), or, roughly, 
if J^^g2:Sign(mi)ei does not belong to Tpf^^-^Vx for all m in Mj. Since 
this latter vector is in Mj, our calculation of Tg^^^^^^V^^ allows to 
rewrite this condition as 

(^sign(TOi)|mi|ei,(C + C'^)m) ^ 0. 

It holds since ^^^j sign(mj)|mj|ej = m and {{C + C'^)'m,m) ^ 1. 
In this case, since gx^t(m)M is approximately X^jgj sign(mj)ejM C Vx, 
the subspace Vx is almost spanned by q'x,t('7i)M and Tp(^^-^Vx- So, 
approximately, 

Then, the expression of qx^t in Lemma 8.2.12 shows that in Vj" + 
<ZJ,^('7^)M, the boundary dBt is approximately the {d — N{C))- 
dimensional affine space + qx,t{^i^) — consider ^'^o Sa{vi) as 
a new coordinate, say, ^Vi. But then, working on + gi^f(m)M, 
the very same argument as in the proof of Theorem 8.2.1 shows that 
TBti^Wqj: t(m){''^)) approximately |ifp/2 for w in the normal bun- 
dle. So, we should have 



Gst {QT,t{m)) 



Id. 



-N(C) 



kx,t(m)| Q{t)y/N{C)' 
Thus, as t tends to infinity, 

detGB,iqx,t{m)r'^' - g(^)(d-JV(C7))/2^(^)(d-iV(C))/4 

~(aiV(C)logt)('^-^(^))/\ 

We can then put all the pieces together, find a way to drop the 
restriction log |mj| ^ (logt)^/'* in the range of integration on D^j, and 
use the formula given in Theorem 5.1 to obtain the — hypothetical 
— approximation 

P{{CX,X)^t}^ " 



/NataNiC)/2 

1 /2 

f det(]3*(m)^p*(m)) 

/ n dA^Uxe.(c)Mx("i) , 
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which is the result. 

Let us now work out the proper arguments. 
We first determine our candidate for CBf Let 

c(i) = (d+^^ + l) loglogt. 
Prom Proposition 2.1 and Lemma 8.2.13, we infer that 
L{l{Bt) + c{t)) = 0{l)e-^^'^*h-<*^I{Btf 

^^-aiV(C)/2(i^g^)d+(iV(C)/2)^lQg^)-<i-(iV(C)/2)-lQ(l) 
= o(i-"^(^V2) . (8.2.11) 

Thus, c{t) is a good candidate for an upper bound of — notice 
again that c{t) is of order log/(-Bt). Many assumptions in Theorem 
5.1 deal with the behavior of tb^ near the dominating manifold. Our 
goal is now to more or less calculate the value of r^^ where we need 
it. 

The following lemma will give us the normal bundle of "Dst i™" 
mersed in A/(b^). 

8.2.14. LEMMA. Let I be in J{C). Let m he a point m Mj with 
maxjgjlog|mi| = o(logt)^/2. Then, T^^^^^^yDBt + ri,t(m)M = Vx- 
Consequently, 

Proof. The map u G M'^\{0} i— > u/lu] G S^-i has differential 
|ii|~^Proj„x at u. Consequently, 

= P™j9i,t(m)-L{ 9X,i,*(?7l)?^ : V G TmMx } 

= P^oj.x,*MX { E u^^- ((^ + C'^)^ ,v)=0}cVx. 

Consequently, T^^ ^(^^^Vst is of dimension 

dim ({ (C + C^)m }^ n Vx) = dim Fx - 1 = N{C) - 1 
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and Tj.^ ^^^•^VBt+qi.t{'rn)R = Vx- Since ri,t(m) is collinear to q'x,t('7^), 
this gives the first statement of tlie lemma. 

The last statement of Lemma 8.2.14 follows from the fact that the 
tangent space of ^i(Bt) fx,t{'m) is orthogonal to qi^t{m) since the 
level sets are spheres. ■ 

As we did in the proof of Theorem 8.2.1 with Lemma 8.2.7, we now 
relate tb^ on D^* to TBt on ^'bi- Since V'^^ is somewhat more explicit 
than T>Bt^ and better parametrized, this will make further calculations 
easier. Using Lemma 8.2.7, we see that for any w orthogonal to 



rBt{^Wrj:,t{m){w)) = TBt 



pt 



m,t[m 



2 



w- 



Pt 



In particular, since expq(O) = q, 

TBt ( exp^x.tM (w)) - TBt {ri,t{m)) 



(8.2.12) 



-TBt 



--^Bt 



w- 



w 



|gx,tMI 
pt 

Pt 



- TBt{qi,t{^)) 

+ 0(1), 



the last equality coming from the parameterization of dBt in Lemma 
8.2.12. We can now prove the analogue of Lemma 8.2.8. 

8.2.15. LEMMA. For m in T>Bt and w in = T^^ ^(^)Aj(b^) e 
T 



in the range \w\ = o{^/log t), we have 

2 



W\ 



TBt ( exPri,t(m) (w)) - TBt {rx,t{^)) ^ 

Consequently, 



+ 0(1) 



as t ^ oo . 



GBt{ri,tM) 



Id, 



-N(C) 



Idinid- 



N(C) 



\Bl{rx,t{m))\ ^N{C)alogt 



as t ^ oo . 
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Proof. Let w he a vector orthogonal to Vj- From the calculation 
preceding Lemma 8.2.15, we see that it is enough to evaluate s = 
TBt{z) for z = expg^^^(^^){w\qi^t{m)\/pt). By definition, ij{z,s) is 
in the boundary of Bt. Lemma 8.2.12 shows that on directions 
orthogonal to Vi, the boundary dBt behaves like a — N{C))- 
dimensional linear subspace — at least in the range w = $^0 Sa{v) 
with V = o{Vi), that is \w\ = o{\ogt)^^'^. So, looking on the 
components on Vj, we must have for (m, v) in TV, 

{il;{z,s) ,ei) = {qx,t{m,v),ei) + ©(logt)"^/^ , i gX. 

To evaluate the left hand side term of the above equality, we use 

' 2^ 



'1 + 



cos (— )gx,t(m) + sin (— ) \qx,t{m)\^ 



To evaluate the right hand side, we have 

{qi,t{m, v) , Bi) = {qi,t{m, 0) , e^) + o(logt)"^/2 
Then, we obtain 

2.1, .n.„.^-l . 1^12 



\qi,t{m,s)\'^ / l + o{logt) ^ 
2 \ cos^ (}w\/pt) 



-1 =^ + «(l) 



in the range \w\ = o{pt), i.e., \w\ — o{\ogt)^/'^ as announced. 
The matrix GBt(Tx,t{''n)) in the statement of Lemma 8.2.15 is that 
corresponding to the assumption (5.17) as weakened at the end of 
Remark 5.2. ■ 

We are now in position to verify that the assumptions of Theorem 
5.1 hold. 

It will be helpful to keep in mind two orders of magnitudes. Since 
is contained in ^i[Bt) — Sd-i{0, pt), points in "Dst arc of order 
Pt, that is of order \/Iogt. On the other hand. Lemmas 8.2.12, 8.2.13 
and A. 1.5 show that any point of dBt n Ti(Bt)+c(t) is at a distance at 
most O(loglogi) of a point qj^ti'm)- Indeed, if |Projy^i;| ^ (logt)^^^ 
the component of v) on V-^ is Xli^i^d- j^i ^^°Sa{vi)e-i, which 

is of order (9(loglogt) thanks to Lemma A. 1.5. Since the projection 
onto the sphere ^i(Bt) is a Lipschitz function when acting on L^^^^^, 
the points in are also at a distance O(loglogi) of Vb^- 
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Assumption (5.1) holds trivially since k = dimMj = A'^ — 1. 

To check (5.2) amounts to proving that any point s on A/^^^), in a 
0(loglogt)-neighborhood of Vst, can be written in a unique way as 
expj,^^(^)(w) for some 1 in J(C), some m in Mj and 

^ ^ ^r-i,t(m)^7(Bt) © T^i,t{mpBt = Vj' ■ 

Lemma 8.2.11 implies that such a point s is actually in an O(loglogt)- 

neigliborliood of a unique set I^Sj H Vj = { rx^f(m) : m G Mj } for 
some X in J(C). For w; belonging to Tj.^^^(^^)Aj(^Bt) © ^r-i,t(m)^Bt> we 
have 

Proj^x exp^x,t(m)(^) = cos (^)rx,t(m) . 

Consequently, the component of s on Vj is in a one-to-one relation 
with cos (\w\/ pt)rx^t{'ni)- This last point being a positive multiple 
of rx,t{fn) £ 'S'd-i(0, pt), it identifies rx,t{fn) and consequently m. 
Looking at the component of s in V^, we can then calculate u; in a 
unique way, and (5.2) holds. 

Our choice of c{t) satisfying (8.2.11) will imply (5.4) ultimately, 
while (5.6) is plain. 

Assumption (5.5) can be verified exactly as in the proof of Theorem 
8.2.1. Indeed, Lemmas 8.2.12 and 8.2.13 show that dBt fl '^i{Bt)+c{t) 
can be approximated by a ruled hypersurface based on where 
the generators are (d — A) -dimensional Euclidean balls of radius 
O(loglogi). 

Assumption (5.7) is verified exactly in the same way as in the 
proof of Theorem 8.2.1. The bound io,JW"(p) = O (log log t) follows 
from Lemma 8.2.15 — or from the discussion at the beginning of this 
assumptions checklist, after the proof of Lemma 8.2.15. Assumptions 
(5.8) -(5.11) are obtained in the very same way as we did in the proof 
of Theorem 8.2.1. 

Checking (5.12) requires some more work. Let m be in Mj. 
Consider a curve m{s) on Mx, such that m(0) = m. The curve 
fx,t{jn{s)) lies on P^^. In Lemma 8.2.14 we proved that can 
be identified with T^^ ^^(^g-^ki(^Bt) © "^rx t{m[s))^ Bt for all small s. 
Hence, for any w in the unit sphere of V-^ and A positive, the curve 
cxpj.^. ^(m(s)) (Aw) on ki(Bt) is well defined. Its tangent vector field at 
s = is given by 



^exp^^_^(^(,))(Aw) 



:=0 
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d / (\\w\ 



rj,t{m{s)) +sm jPtri 

\ Pt y \w\j 



s=0 



= cosf^)rx,t,*(m)m'(0). (8.2.13) 



Pt 

Let p = exp^^^(-„(-g))(Au;). Since TTBtip) = i"i,t{m{s)) provided p 
stays in w^^^^^^^^^), and since the orthocomplement ofkenrBt,*{p) has 
dimension k, this orthocomplement can be identified as 

(ker7rBt,*(p))"^ = rx,t,*(m)r„Mj = T^^^(^^yDBt ■ 



Moreover, (8.2.13) implies that for u in T^^ ^(^^-^Vb^, 

'\\w\ 
P 



irBt,*{p)u = u^ cos ^^M^ . 



IdjRfc ^ cos (^^M j . 



In other words, the restriction of TrBt,*{p) to (ker 7rBj,*(p)) is the 
map 

P 

Consequently, J-KBtiv) = cos [X\w\/p)~'' = 1 + o(l) uniformly in the 
range A ^ O(loglogt) = o{pt) and m in T>Bf This proves (5.12). 

Before checking (5.13), we need to evaluate the candidate for the 
limiting integral, namely 

f e""^'I[o,c(t)](rBj 
K |D/|^(detGBjV2 "^^-^ 

— notice that we use the Riemannian measure on and not 
that on V^^. Since rx,t{''n) is in the image of qi^t{^) by the 
map u G M'^\{0} i-^^ ptu/\u\ whose differential at qx,t{'m') is 
(pt/kx,t(m)|)Projq^^(^)±, we have 

dMvs, (ri,t(™)) = dMx,^^ {qxA^^')) (l + o(l)) , (8.2.14) 

in the range |gx^t(m)| ~ p^. This range includes that for which 
rBj(rx,t(m)) is less than c{t), thanks to Lemmas 8.2.12 and 8.2.13. 
Furthermore, 

TBt {rj,t{'^)) = {qiAi^)) + \{\(i^,ti^)f - Pt) 

= I{qj,t{m)) - I{Bt) 

= log |TOi| — alog7 + o(l) as t — >■ oo 
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uniformly over TBt {'''t,l{^)) ^ c{t) — the first equality comes from 
(8.2.12) with 10 = 0; the second from the fact that qx,t{'m-) belongs 
to dBt, and so TBt{qi,t{m)) = 0; the third from (8.2.9) and Lemma 
8.2.13. 

Uniformly in tb^ (ri,t(m)) ^ c{t), we have 

|D/ (rx,t(m)) ~ |D/(gi,t(m))| ~ ^/Nak^t . 



Consequently, 



L 



|D/|(<^-*^+i)/2(detGBji 

/ 7"I[o,cW](tbJ 

niexl^'l''(^"logt)(<i-(^-i)+i)/4(A/-alogt)-(<i-^)/4 



^(Aralogt)V2 (iogt)(^-i)/2 



Thus, the formula in Theorem 5.1 becomes 

^ ' Jvs, |D/|(<^-^+i)/2(detGBjV2^-'^^«* 

(i^,,»a"/=^)^ 1 ^ r det(p,(m)V(m))'/' 

I[o,c(t)] (jBt (rt,x(m)))d>lMi(m) . 

This formula is valid whenever a is positive. To remove the term 
I[o,c(t)] ijBt {fi^tifn))^ from the formula, it is enough to prove that for 
all Tin J(C), 



/ 



1 /9 

, '^*^J) dMMAm)<oo, (8-2.15) 

because limt_>oo c{t) = oo. This is where we need the assumption a > 
N/2. For this purpose, we first derive a bound for det(p^,(?7i)^]5^(m)) . 
The following result will be useful. Since Mj is in the complement of 
a neighborhood of the origin, it extends Lemma 8.2.11. 
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8.2.16. LEMMA. There exists a constant K depending only on the 
matrix C such that for any I in J{C), any m in Mj and i in X, 

\m\ . . I 
— ^ \mi\ ^ K\m\ . 



Proof. Assume that the lower bound were false, and that, 
for instance, inf^gMi l^'il/l'Til = for some i in X and Z in 
J(C). Then, there exists a sequence m{n) in Mx such that 
lim„_^ oo I — 0. Given Lemma 8.2.11, this forces 

lim^^oo |w.(n)| = oo. Set m*(n) = m(n) — m(n)ej = Proj xm(n). 

i 

Define s[n) — |m*(n)| and v{n) = m*(n)/|m*(n)|. Since |m(n)| tends 
to infinity with n, the condition mi{n) /\m{n)\ = o(l), ensures that 
|m*(n)| does not vanish for n large enough. Thus v{n) is well defined 
for n large enough. Moreover, |mi(n)| = o(s(n)) as n tends to infinity. 

We then have 

1 = {Cm{n) , m{n)) 

)((C + C'^)ei,v{n)) + s{nf{Cv{n),v{n)) 

= o(s(n))^ + s{n)^{Cv{n) , v{n)) . 

Since v{n) is in the compact sphere ^tv-i C Vx\{i]^ we can assume, 
up to extracting a subsequence, that v = lim„^(x, v{n) exists. Then 
V belongs to and we must have {Cv,v) ~ s{n)~^ = o(l), i.e., 

{Cv,v) = 0. This contradicts assumption (8.2.8). 

The upper bound is trivial since \mi\ ^ |m| anjrway. ■ 

8.2.17. LEMMA. If I is in J{C) andm belongs to Mx, then 
^ det(p*(m)VM) < "^~^(^)^^^ • 



Proof. From Lemma 8.2.16, we infer 

V a > , n — r ^ ~i — r^^vi 
^-^ \mi\ \m\ 

in the sense that the difference between the right and left hand sides is 
a nonnegative matrix. Consequently, since restriction preserves matrix 
ordering, 
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Taking the determinant preserves the ordering as well, and the result 
follows. ■ 

We can now prove (8.2.15). 

8.2.18. LEMMA. If a > 2/N, then 

det(j>*(m)'^;?* (m))^^'^ 



J 



a 



dM.Mi{m) < oo . 



Proof. Using Lemmas 8.2.16 and 8.2.17, it is enough to prove that 



Mi |m|^-i+^" 



< oo . 



First, using a change of variable in polar coordinate, we see that for 
any e positive and s > N, 

\x\^e Us 

Take e < eo, where eo is as in Lemma 8.2.11. In Vi identified with 
M.^, make the change of variable x = 9m, with 9 positive and m in 
Ml. We claim that 

To obtain the right hand side and prove the claim, we argue as follows. 
Let m(ti, . . . , tjsi-i) be a parameterization of Mj. The Jacobian of the 
transformation x = 9m{ti, . . . ,tN-i) is 

, / dx dx dx\ ^Ar_i , /dm dm \ 

Using the multilinearity of the determinant, writing m as a component 
on spanj ^ , . . . , | = T^Mj and a component on (T^Mj)-^, 

we obtain 

/dm dm \ , /dm dm 
det — — , . . . , — , m 



-dti ' ' ' ' ' dtN- 



(om om \ 
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Up to a sign, this last determinant is 

iT^ /dm dm, \"^fdm dm, \\^''^ 

|Proj(T.M.)xm|^det(— , • ■ ■ , ^) J • 

This proves our claim. If s > iV, we obtain, after performing the 
integration in 9, 

oo> / dMMxM. 

JmeMx \'"'\ 

Recall that (T^Mi)-^ n Vj = Projy^(C + C^)mR. Since Mj is 
included in Vx, it follows that for any m belonging to Mj, 

. / N (mJC + C^)m) ,^ _rp, 

= |Projy,(C + CT)^|2 P^ojv.(C^ + C^^)- • 

Thus, 

1 



|Proj(T^Mi)i("^)l ^ 



|Projv^^(C + CT)|||m| " ||C 



Consequently, for s > N, the integral / , -.^ is finite. This 

proves Lemma 8.2.18. ■ 

Putting all the pieces together, we are left with only assumption 
(5.13) to check. But this is plain from the calculation we did in 
Lemmas 8.2.16 and 8.2.18. This concludes the proof of Theorem 
8.2.10. ■ 



We can now state an analogue of Theorem 8.2.9, that is a result on 

the limiting behavior of X given {CX, X) ^ t as t tends to infinity. 
For I in J{C), Lemma 8.2.11 implies that the vector (sign(mj))^^^ is 
constant on the connected components of Mj. Hence, to a connected 
component C of Mj, we can associate a unique unit vector 

ec = N~^/'^ ^ sign(mj)ei , 
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where m is any point in C. Moreover, for a > 2/N and G{m) as 
defined in Theorem 8.2.10, the number 



is finite. 



8.2.19. THEOREM. Under the assumptions of Theorem 8.2.10, 
the distribution of the random vector (sign(Xi) log j^^.^^/log\/i 
given {CX,X) ^ t converges weakly* to 



'' = J2f^cSec/Yl 



where the sums are over all connected components C o/Ux6J(C) 

Proof. We proceed in a similar way as we did for proving Theorem 
8.2.9. Set = Pt- The numerator of the measure in (5.18) when 
looking for the conditional distribution of Y/ pt given Y e Bt is 

pf-^exp(-TB,(ptr)) 

■dMvg./ptir) 



~ {Nalogtf/^ exp ( - rsMr)) dMv^Jptir) . 
Let / be a positive continuous function. Then (8.2.14) implies 

J f{r) exp ( - TBtiptr)) dM-p^^jp^ir) 

~ J fir) exp ( - TBtiptr)) dMv^jp^ir) 

as t tends to infinity. Using the parameterization of "Dst ^-^d the 
expression for TBt(rxtim)) obtained from equation (8.2.14), this last 
integral is also asymptotically equivalent to 



r 



XeJ{C) c 



det(p*(m)V*("i)) dMMi(m), (8.2.16) 



where the second summation is over all connected components C of 
Mj. But if m is in a connected component C, then 

rjt{m) 
hm = ee , 
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uniformly in the range of m's such that qi^t{m) belongs to 2?^^ — 
i.e., maxjgilog is less than (logi)^/^. Consequently, as t tends to 
infinity, (8.2.16) converges to 

E E /(ec)7")"c = l''Y.^'c [ /dJ^ . 
xe J(C) ceMx c 

Since the measure involved in (5.18) is normalized, we proved that the 
probability measure with density proportional to 



|D7(p,g)|^(detGB.(m))^ 

converges weakly* to v. Thus, (5.18) holds. 

Assumption (5.19) holds trivially. Assumption (5.20) follows from 
the fact that pt is of order y/logt, while for p in Vst, the set 
'^B^ip) ^ B_t,M is in an 0(loglogi)-neighborhood of Vb^ thanks to 
Lemma 8.2.13. 

Applying Corollary 5.3, we deduce that the conditional distribution 
of Y/ ^ Na log t given Y e Bt converges weakly* to u. 

Using the Skorokhod representation Theorem as we did in proving 
Theorem 8.2.9, Theorem 8.2.19 follows from 

5^o$(^ecV^alogt(l + o(l))) 

= VNecS;^o^(^y/alogt{l + o(l))) 

= ViVec exp log Vt(l + o(l))) 

as t tends to infinity, thanks to A. 1.6. ■ 

One way to interpret Theorem 8.2.19 is to say that X given 
{CX,X) ^ t is distributed as t(V2)+o(i)5' ^^ere S is distributed 
according to u. In other words, we obtained something like the first 
term of an expansion of X given {CX, X) ^ t. Given the work done, 
a little extra effort will give us a second order term. That is, given 
{CX,X) ^ and given that (sign(Xj) log |Xj|)^^^^^/ log \/t is close 
to some ec, we can derive the limiting distribution of X. Refining the 
asymptotic analysis, we could obtain an asymptotic expansion of the 
distribution of X given {CX, X) ^ t, as t tends to infinity, in term 
of successive conditional distributions. Such type of result is easier to 
phrase in term of random variables than in term of distributions. 
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8.2.20. THEOREM. Under the assumptions of Theorem 8.2.10, 
the random variable X given {CX, X) ^ t can he represented as 
\/tmgH{l + o(l)) + Tg + o{l), where the random variables g, nig, 
H and tg are as follows. 

The discrete random variable g has distribution v. 

Let C be a connected component of some Mj. The conditional 
density of rUg given g = ec is proportional to 

J,^'"^ "^^ , det G(m) dMcim) . 

The random variable H has a Pareto distribution 

P{H ^ A} = A-°^^ , A ^ 1 , 

and is independent of g and nig. Finally, Tg given g is a random 
vector in V^, with independent components, all having the original 
Student-like Sa distribution. 

One way to read Theorem 8.2.20 is in terms of simulating X from 
its conditional distribution given {CX, X) ^ t for large t. We pick a 
connected component C with probability proportional to iic- Once C 
is picked, it lies in a unique subspace Vj. In Vj, we simulate m with 
distribution proportional to 

J^ ^'^'^f^ I det G{m) dMc{m) . 

Next, we simulate H with the Pareto distribution. Then the part 
of X in Vi is \/iniH(^l + o(l)) as t tends to infinity. The part 
of X in is a random vector with independent and identically 
distributed components from the initial Student-like distribution, up 
to an additive term of order o(l) as t tends to infinity. We will 
explicitly calculate the norm of Projy^^^^]?(C)ec at the end of the proof 
of Theorem 8.2.20. 

Proof of Theorem 8.2.20. The intuition behind the proof is extremely 

simple given all that we did. Looking at y = <I>^o Sa{X), we want to 
obtain an approximation of Y given Y G Bt, and invert it to obtain 
one of X = 5^0 given {CX, X) ^ t. Lemma 8.2.12 asserts that 
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the points in Bt near Q{t)\/Nec are of the form il){qi^t{m.,v)^s). Set 
w = Sa{v), and define 



qi,t{in, w) = sign(mi) [Q(t) H ,J\^ — je-i + w, w e Vj . 

Formula (8.2.9), the definition of the normal flow, and Lemma 8.2.13 
show that the normal density at ipi^qj^ti^^ '^)' ^) 



exp 



'(V's(gx,t("i,w;))j =exp - l{qi^t{m,w)) 

R(t) c -|«,|2/2 



-=-^-^(1 + 0(1)) (8.2.17) 



In this expression, the term is an exponential dc^nsity, the term 
g-|w)| /2 jg ^ Gaussian one, and the term fliGjI'^il" '^i^l S^^'^ us 
another density. We can parameterize points y of Bt in term oi s,w,m. 
Then, we interpret these parameters as random variables. This will 
give a representation of the random variable Y given Y e Bt, and we 
will pull back this representation to X. 

However, we need to be careful with the different scales. For m in 
a connected component C of Mj, 

VlogtProj^± {qi,t{m, w) - Q{t)VNec) = p{m) , 

while 

Projy2.gx,t(m,w) = w. 

Thus, the part of y in should be centered and rescaled by VTogT, 
while that in V-^ is already of order 1. 

To proceed rigorously, let / be a nonnegative smooth function 
defined on x M'^"^, with compact support. 

For some 1 in J{C) and a connected component C of Mj, let us 
evaluate the integral 

^ /(^/b^Projy^(y-g(^)v^ec),Projv^x^/)e-^(^)d^/. (8.2.18) 

Divided hy P{Y e Bt}, this integral will give us the limiting condi- 
tional behavior of Y given Y G Bt, after proper normalization. 

If the pro j ection of y onto Vj is not in a neighborhood ofQ{t) \fN ec , 
then -y/logt Projy^ [y — Q{t)\/Nec) diverges as t tends to infinity. 
Since / is compactly supported, such points y do not contribute to 



132 



Chapter 8. Application to quadratic forms of random vectors 



the integral for large t. From the preceding, since / is bounded, 
we can also restrict the range of integration for those y's such that 
I{y) ^ I{Bt) + c{t). Let us make the change of variable 



y = i^{qi,t{^-,w),s) = Jl + Tz — ^ — -r:^qi,t{m,w) . 
Notice that 

|gj,t(m, if)!^ ^ ~ iVa logi 

as t tends to infinity, and uniformly in the range TBt{qx,t{^)) ^ '^(0- 
Therefore, 

s / 1 



y = Q.T,t{rn,w) + ^ ec + o( 



V Na log t V "v/log t 

as t tends to infinity, uniformly in the range /(y) ^ -^(-Bt) + c(t). 
Consequently, in that range of y's, 

\/logt Projy (y-Q{t)VNec) = p{m)-\ — ^=ec+o{l) as i ^ oo . 

Using (8.2.17), we can rewrite (8.2.18) as 

, , ko,i{Bt)+c{t)]{liy))f(pi^) + -^ec,w)x 
s'^oJmeMx Jwev^ ^ VJya ' 

-R{t) 

"I""! J{m,w, s)dMMx{'m) dw ds {I + o{l)) , 



-e e 



where J(m, w, s) is a Jacobian term. To calculate it, we first have 

dy qiA'm,w) ec 



ds |~ / /7~i 2s ViVcdogt 

|gj,t(m,w)| 



(1 + 0(1)) 



as t tends to infinity, uniformly in s in any compact set of M and m, w 
such that I{y) ^ I{Bt) + c{t). 

The explicit expression of i/jg (q) for the Gaussian distribution gives 
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Let m = m(iti, . . . , it^v-i) be a local parameterization of C. Then 

9Qi,t . 



1 din 
'-{m,w) = — p^{m)-—- . 



Consequently 
dy 



A/log i 

dqx^t{'m,w) _ 



duj ■^*.'^vi-^>''V"' -^y \/Togt 



1 Qui/ 

P*{m) — {l+o{l)) , 



uniformly in y such that /(y) ^ ^i^t) + c(i). Finally, since 
^qx,t{m,w) = ej, we have 

dy 



dwj 



= il^s,*{qi,t{'m,w))ej = +o(l) 



as t ^ oo . 



Define the {d x (i)-matrix with columns indexed by i and rows indexed 
by j, 



F = 



N-l 



d-N 



N-l 1 d-N 

\ dui duj I \dui' ds/i \ dui dwj / j,,- 

dy dy \ 
Vs' duj / j 

dy d\ 



dy I dy dy 



w 



duj I \ dwj ' ds/i \ dwi ' dwj / ij 



This ensures that J{m,w,s) = (detF)^/^. Since dy/dwi is orthog- 
onal to Vj, while dy/dui and dy/ds are in Vj, and since dy/dwj 
is roughly ej, the determinant of F is equal to o(logt)~^ plus the 
determinant of the upper left N x N block of F, that is 



det 



T^Y*^ ^mfi'^*"^ ^B^/ij ViValogi 



alogt^ C)P*( 



Analog* 



\ 



\ VATalog 

— in this determinant, i,j run over 1, . . . , A'' — 1. This determinant 
is equal to 



Ara(logt) 



AT 



dct 



-det|^^^)*(m) — ,p*(m) ^ 



iVa(logt) 



dui 



dui I 
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Consequently, up to o(logt) , 

|Proj(T p(c))iec| ^ x\V2,.. / , 

dy = \ — det (p* (m) (m) ) dMc (m) ds dw . 

Let us write 

p{C) — {p(m) : m £ C} , 

the image of a connected component C by ]?(•). If follows from our 
calculation that that the integral in (8.2.18) is 

det(pj(m)p*(m))^/^ dA^cM "^"^ ^/]v^(iog^)JV/2 + °^^^) " 
Elementary algebra shows that 



e 



\/iV^(logi)^/2 t^'^/^VNa {2T:)id-N)/2 ■ 
Consequently, as t tends to infinity, (8.2.18) is equivalent to 

,-\w\y2 



(2vr)('i-^)/2 



l^^°j(^p(m)P{C))-'^<^l det(]?*(m)'^p*(m))^^^dsdA^c(?7i) du; . 

Combining this estimate with that for P{ X E Bf} given by Theorem 
8.2.10, the conditional distribution of 



lO] 



gtProjv^(y - g(t)ViVec),Proj^x(y)) given Y e Bt 



and y in a neighborhood of ec converges weakly* to that of (p(M) + 
{S€c/-\/Na),W) where M, S and W are independent with respective 
densities proportional to 

1T3 ■ ,det(p*(m)VM)^''^ 
|P™JT,(^)(p(C))^ec| Yl.^^\mi\<^ d7Wp(c)(p) , 

e~^ds, and A/'(0, Idy±). 

In particular, S has an exponential density. 
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Using Skorokhod's representation theorem and up to changing 
the versions of the random variables, given y € -Bj and Y in the 
neighborhood of Q{t)\/Tjec, we have 

ProjviF = Q{t)VNec + (p(M) + ^^c) + o{logt)-^/^ 

Pioiy±Y = W + o{l) . 

Then, given X e At, we have 

X = S;^o $(Proj,.^y) + 5^ o f (M/ + o(l)) . 

The term Tj = o ^(W + o(l)) is asymptotically a random 
vector in with independent coefficients having a Student-like Sa 
distribution. Then, for i in X and t large enough, Lemma A. 1.6 yields 

5ro$((Proj^^y,e,)) 

+ o(log 

= sign(m,)5ro $(Q(i) + (p(M) , e,) + ^ + o{logt)-y') 

= Vi sign(mi) exp + ^'^^'\ p{M), a) + o(l)) . 

If both M and m belong to C, then 

a-^/hign{mi){p{M),ei) = log|(M,ei)| . 

Therefore, 

X = ViY, e^^^""^^ {M, ei)ei (l + o(l)) + + o(l) 

= Vte^/("^)M(l + o(l)) + Tj + o(l) 

To conclude the proof, notice first that e^/ has a Pareto distribu- 
tion, since 



p| exp (^~^) ^ X I = P{ u; ^ aA'' log a; } = 
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Finally, as announced, let us calculate the norm of the projection of 
ec onto the orthocomplement of the tangent space oi p{C) at p{m). 
Since 

. . ej (g) Ci 

mi 

ieT ' ' 

the tangent space of p{C) can be identified as 

_ \fni\ 

Defining ?;(to) = X^jgj |mi|((C + C'^)m, ei)ei, we have 

Tp^m)p{C) = VTn{v}^. 

Consequently, using that {Cm,m) = 1, we have 

{v{m),ec) 



\v{m)\ 



1 Eiex™((C + CT)m,e,) 



|Projr^^^^p(C)^ec| 



= (iV5^mf((C + CV,ei)^)'^'- 

This concludes the proof of Theorem 8.2.20. ■ 

Theorems 8.2.1 and 8.2.10 do not settle completely the tail behavior 
of {CX,X). For instance, it may happen that the largest diagonal 
term of C vanishes, or that N{C) > 1 but (8.2.8) does not hold. The 
situation not covered assumes that 

{Cu, u) = for some nonzero n in Vx and some T in , . 
T, of cardinality strictly less than A''(C). 

The technique developed in this section may work in this situation; 
but we will see in the next section that, under (8.2.19), some extra 

complication is added. For the time being, we will only prove a rather 
weak result. It will be useful for the statistical applications developed 
in chapter 9. 
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Denote Nq{C) the smallest cardinal of a set X such that (8.2.19) 
holds. Under (8.2.19), the next result asserts that the tail probability 
of {CX,X) is much lighter than f'^^o^'^)/^ . 

8.2.21. THEOREM. Let X be a d-dimensional random vector with 
independent and identically distributed components having a Student 
like distribution with parameter a. Let C be adx d matrix, and write 
No = No{C). Under (8.2.19), and if a > 2/Nq, 

lim i"^o/2p{ (CX, X) ^ i } = . 

Proof. The statement is obvious if C is negative. Thus, we assume 
that C is not negative. For e positive, denote = C + eld. The 
proof relies on two very basic observations. The first one is that for 
any positive e the matrix Ce — C is positive. Consequently, for any t, 

P{(CX,X) ^ n ^ P{(CeX,X) ^t}. 

Assumption (8.2.19) states that Nq < N{C). Let 

Jo{C) = {IeT : \I\ = No{C),3ueVj\{0}, {Cu,u) = 0}. 

The second observation is stated in the following lemma. 

8.2.22. LEMMA. If e is positive and small enough, then N{Ce) = 
Nq{C) and J{Cf) = Jq{C). Moreover, satisfies assumption 
(8.2.8). 

Proof. Denote by T/Vq the set of all subsets of { 1, . . . , d } of cardinal 
at most Nq. Assumption (8.2.19) is equivalent to the following. 

Whenever 2 is in Tat^ \ Jo(C), the compression of C to Vj is negative; 
moreover, for any X in Jq(C), this compression is nonpositivc and 
there exists a nonzero zj in Vj for which {Czj, zj) = 0. Consequently, 
(8.2.19) implies the existence of a positive eo such that {Cu, u) < — eo 
for any X in Tjsiq \ Jo{C) and any unit vector u of Vj. If e is positive, 
less than eg, and X is in Tjvq \ Jo(C), the compression of to Vj is 
negative. Furthermore, if X is in Jo(C), then {CeZj,zi) = e\zi\'^ is 
positive. This proves Ar(Ce) = A^o and J(Cg) = Jo{C) as well as 
satisfies (8.2.8). ■ 

Let e be positive and small enough so that the conclusions of Lemma 
8.2.22 hold. For X in Jo(C) = J(C,), denote 



M,,i ^{meVx: {C,m, m) = 1 } 
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Using our two observations and Theorem 8.2.10, we infer that for any 
e positive and small enough, 

limsupi"^o/2p{ {CX,X) ^ t} 

t—*oo 

Ks^ao^^ ^ f detG(m) ... , , 

To conclude the proof, we show that the above upper bound tends 
to as e tends to 0. For this purpose, we need a good description of 
Me^x- It will be helpful to have some understanding of the sets 

Zj{C) = {ueVj : {Cu,u) = 0}, leJoiC). 

Despite the fact that these sets are defined by a quadratic equation, 
minimality of A^o implies that they are linear spaces. 

8.2.23. LEMMA. Under (8.2.19), the setZj{C) is a one dimensional 
vector space. It coincides with the null eigensubspace in Vx of the 
compression of C + C'^ to Vi. 

Proof. Since the compression of C to is nonpositive, 

Zx{C) = {ueVi : {Cu, u) = sup {Cv, v) } . 

veVx 

The equality 2{Cu,u) = {{C + C'^)u,u), shows that Zi{C) is the 
eigensubspace associated to the largest eigenvalue of the compression 
of (C + C"^) to Vj. Thus, Zx{C) is indeed a linear space. 

Let i be in X G Jo{C) and it be a nonzero vector in Zj{C). If 
{u, Ci) vanishes, u belongs to the (iVo— l)-dimensional subspace 
contradicting the minimality of N{C). Consequently, no component 
otuinVx vanishes. 

Assume now that we can find two linearly independent vectors 
u and V in Zx{C'). Since none of their components in Vi vanish, 
there exists a linear combination of these two vector with at least one 
component in Vj vanishing. This linear combination is a vector which 
contradicts what we just showed. Consequently, there are not two 
linearly independent vectors in Zx{C). Since Zx{C) in nonempty by 
definition of A^O) this concludes the proof. ■ 

It follows from Lemma 8.2.23 that the unit sphere of the space 
Zx{C) is actually made of two points, z and —z. 
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To describe M^^j, we now introduce the sphere corresponding to 
the compression of —C to the orthocomplement of Zx{C) in Vj, that 
is 

Sj{C) = {veVx : {Cv,v) = -l-v ^Zt{C)} . 

Since Zx{C) is the eigenspace associated to the simple null eigenvalue 
of Projy^(C + C""^)!^^, the set Sj{C) is a compact ellipsoid. We can 
now make M^^j explicit. 

8.2.24. LEMMA. For any X in Jo{C), for any e positive and small 
enough, 

Me,x = { -^Vl + Kl-e)^ + yrsv: ve Sx{C) , 

O0,r?G{-l,l}}, 

where z is in the unit sphere of Zx{C). 

Proof. Let 1 be in Jo(C). A point m in Vx belongs to M^^x if and 
only if 

1 = (Cm,m) + e|mp. (8.2.20) 

The nonpositivity of C on Vx implies {Cm,m) ^ 0. Therefore 
|mp ^ 1/e. In particular, m is nonzero. Write A = Imp and 
p = m/\m\. Equality (8.2.20) and nonpositivity of C on Vx imply 

0^ {Cp,p) = -e+\^-€. (8.2.21) 
A 

The function u ^ {Cu. u) is continuous. Its restriction to the 
— compact — unit sphere of Vx is maximum exactly on Zx{C). 
Consequently, if U is an arbitrary neighborhood of the unit sphere 
of Zx{C) on the unit sphere of Vx, the inclusion 

— ■. me Me,x \ C U 
\m\ J 

holds for any positive e small enough. Then, wc can write p = 
cos6z + sindq for some q in Vx of norm 1 and orthogonal to z, and 
some 9 close to mod tt — the coefficients cos and sin 9 are imposed 
by \p\ = 1; speaking geometrically, we parameterize the sphere in 
normal coordinates. Since Projy^(C + C'^)z = thanks to Lemma 
8.2.23, equation (8.2.21) forces 

sin^ e {Cq, q) = -e + \. (8.2.22) 
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li 9 = mod vr, this forces A = 1/e and p = z. Thus m = z/y/e. 
Assume that 9^0 mod tt. Since Sj(C) is invariant under { —Id, Id }, 
equahty (8.2.22) imposes 



But equation (8.2.20) yields 

1 = -eA + 1 + e(A cos^ 6* + eA - 1) , 

from which we can obtain cos^ 9. Thus, 

m = r\\/\ + A(1 — e)2; + \/eA — If for ry G { — 1, 1 } . 

Set s = eA — 1 to obtain the parameterization given in the lemma. 
Notice that for s = —1, we obtain m = r\\f\z. ■ 

Wc can now obtain some good bounds on the components of points 
m belonging to M^^j. Wc write = (m, Cj) the i-th component of 
m, as we did in the proof of Theorem 8.2.10. The following statement 
is the analogue of Lemma 8.2.16. 

8.2.25. LEMMA. There exists a positive M such that for any positive 
€ small enough, any point m = -^^X + s(l — e) z + \fsv in M^x 

satisfies for all i inX the inequality 



Proof. For e small enough and any positive s, we have s/(2e) ^ 
s(l — e)/e ^ s/e. If r]{z,ei) is positive, this implies 




for some v G Sj{C) . 



We can then write 



m = 



VXp = VXcos9 z + VeA — 1 w . 





and 
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Since Sj{C) is compact and the functions s i— > s/(l + s) and s i— > 
(2 + s) /(I + s) are bounded on [0, oo), the result follows. 

If ri{z,ei) is negative, we proceed similarly. ■ 

We can conclude the proof of Theorem 8.2.21 with the next result. 

8.2.26. LEMMA. For any I m Jq{C), 

f det G(m) , . , , , 



Proof. Arguing as in Lemma 8.2.17, we obtain 

detGM ^ M/\m\^°-^ 

for any m in M^^j and some fixed positive M, not depending on e or 
m. Using Lemma 8.2.25, we have 



/ 



det G{m 



-dAlM,,i(m) 



Nq — 1 + Nqol 
2 



dA^MeiMO(l), (8.2.22) 



as e tends to 0. To express the Riemannian measure on Mg^x, notice 
that Sx{C) is a manifold of dimension A^o — 2. Write v{x\, . . . , xnq-2) 
for a local parameterization of Si{C). It induces a parameterization 

m{s,Xi, . . .,XNo-2) = ■^'\/Y+~s{l~~e) Z + ^/sv{Xl, . . .,XNo-2) 



of Mg X- In this local chart 



^l^^(iVo-2)/2 

:det 



dv 



eJl + s(l-e) 



g(iVo-2)/2 5^ 

H . ^ det 



/ dv dv \ 



(z,—,..., 
\ ' 5ari ' ' dxNo-2 

dv 



) 



ds A dxi A ... A dxiVo-2 • 



"""V dXl' " ' ' dXNo-2 

Since ^ is orthogonal to Sj{C) and of unit norm, we have 

dv dv 



det 



/ dv 
V ' dxi ' 



da;i A . . . A da;Aro-2 = dTW 53.(0 ■ 
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Therefore, 



s(JVo-3)/2 

+ ^ |Proj(T;^5^(c))±v| dsdMs^{c){v) ■ 



2 

Consequently, (8.2.21) is less than 

^(No-2+Noa)/2 g{No-2)/2 



I I 

Js>0 JS 



>oJs^iC) (1 + s)(^o-l+iVoa)/2 + 

g(Aro-l+Aro)/2 g{No-3)/2 
+ (i + ,)(iVo-l+iVoa)/2 2 1^1 dsdMs,iC){v) . 

We can bound the integral in v by the constant diamSx{C)Yo\Sx{C) , 
for instance. 

Then, the integral in s is less than 

Nq-2 Nq-I+Nqc 1 JVp-a AfQ-l+iVoa 

2 + 2 I s~ 2 -2 + s 2 ~ 2 ds 

"OO 

2 + 4 1 s~'- 2-ds < oo . 



Therefore, the right hand side of (8.2.21) is less than a constant times 

£2 + e 2 

This tends to as e tends to 0, provided a is strictly larger than 
(2/i¥o) — 1- This concludes the proof of Lemma 8.2.26 as well as that 
of Theorem 8.2.21. ■ 

8.3. Heavy tail and degeneracy. 

The results of section 8.2 arc incomplete. The tail behavior of {CX, X) 
is not described when the largest diagonal coefficient of C vanishes, 
or when (8.2.8) does not hold. The proof of Theorem 8.2.10 breaks 
down in these cases. 

In example 4 of chapter 1, we dealt with the tail of the product 
of two independent Cauchy random variables. In the language of the 
current chapter, we considered the 2 x 2-matrix 

^^1/0 1 



2 V 1 
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We saw that the tail of (CX, X) is in log t. For this specific matrix 
N{C) = 2. Comparing with the result obtained in the previous 
section, we have an extra logarithmic factor in t. Going back to 
chapter 1, the factor is explained by the fact that for most points 
{x, y) on the boundary of 

At = {{x,y) : xy^t} = VtAi , 

the minimum of x and y tends to infinity, sharing some similarity 
with the case where N{C) = 2 and (8.2.8) holds. The factor logt is 
explained by the closeness of the boundary dAf to the level sets of the 
Cauchy density, along a sizeable part of the hyperbola xy = t. 

Quite amazingly, Theorem 5.1 can still be used in this situation, 
but the analysis is a bit more involved than needed to prove Theorems 
8.2.1 or 8.2.10. Our goal in this section is rather modest. We do not 
intend to obtain the tail behavior of the quadratic form whenever 
(8.2.8) does not hold. We will concentrate only on matrices C with 
vanishing largest diagonal coefficient. The reason is twofold. First, 
this simple degeneracy is sufficient to understand what we should do 
when (8.2.8) does not hold. Second, we will make use of this specific 
case in chapter 11, when studying a statistical application. 

8.3.1. THEOREM. Let C be a d x d-real m,atrix, with vanishing 
largest diagonal coefficient. Let X be a random vector m W^, with 
independent components, having a Student-like distribution Sa- The 
following asymptotic equivalence holds as t tends to infinity, 

P{ {CX, X)^t}^ ^i^f,a«" E E + ^^-.^l" • 

Proof. Since {CX,X) = {{C + C^)X, X) /2, we will assume 
throughout the proof that C is symmetric. The proof builds upon 
that of Theorem 8.2.10 in the case N(C) = 2. However, Lemma 
8.2.11 cannot be used anymore since (8.2.8) does not hold. 
As before, we denote 

At = {peM.'^ : {Cp,p) ^t} = VtAi 

and 

Bt = ^-oSa{At) = {1>^oS^{p) -.peAt}. 
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This guarantees 

The main part of the proof is to precisely locate and describe the 
points in Bt with nearly minimal norm. This will be done through 
the next five lemmas. 

Since all the diagonal coefficients of C are nonpositivc, N{C) ^ 2. 
Our first lemma shows that N{C) = 2, an easy fact. It also introduces 
points which will be essential to determine a dominating manifold. 
Define 



MC) = {{i,j} : ^d, Ci,iCjj = and dj } . 

If { z, J } G J*(C) then either Cj^j or Cjj is null. 

8.3.2. LEMMA. Let I = be in J^,{C), and assume that 

Ci^i = 0. The solutions of the equation {Cm, m) = 1 inVx are 

[t^T. ~ '^j'^i'j) + ^0^3 ' m^- e M \ { } . 



Proof. Let m = mjej + mjej be a point in Vj- The equation 
1 = {Cm, m) can be rewritten as 



1 = 2Cijmimj + Cjjmj . 

Since {mi, 0) is not solution, we obtain the result in expressing mj as 
a function of mj. ■ 

It follows from Lemma 8.3.2 that J{C) contains J*(C). As we did 
in the proof of Theorem 8.2.10, define 

Mj = {meV2 : {Cm, m) = 1 } , X G J(C) , 

and 

7x = inf I \mi\ : m e Mx | . 

iei 

It will be convenient later to extend slightly the notation introduced 
so far. For X = { i, j } in J{C), we will write Mjj or Mj^i for Mj. 
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If Z belongs to J(C) \ J*{C), Lemma 8.2.11 shows that tj is posi- 
tive. If X = { i, j } is in J*{C), Lemma 8.3.2 gives 

because Cjj is nonpositive. Consequently, 

7 = min{ 7x : T G J(C) } 
is positive. As in Theorem 8.2.10, we hope 

will be minimum on Bt at points lying in $^0 (\/t|Jxe j(c) -^2:)- 
Let 

R{t) = 2a log Vi - log log - 2 log(X,,aa°/22v^) 

-|-log(27r)'^/^ -|-alog7. 

If I and m are fixed, respectively in J{C) and Mj, the proof of 
Theorem 8.2.10 — see the calculation of v)) before Lemma 

8.2.13 — shows that 

l{<^^o SaiVim)) = -alog7 + alog (Jjlm^l) +o(l) 

as f. tends to infinity. This STiggcsls that I{Bt) = +o(l) as t tends 
to infinity. To locate the minima of / on 5^, we try to imitate the 
second assertion of Lemma 8.2.13. The coming lemma gives a coarser 
estimate. After its proof, we will be able to explain more precisely the 
difficulty created by the nonemptyness of J*(C). 

8.3.3. LEMMA. Let M he a number strictly larger than 2, and e be 
positive. The set of all points p in dAi such that 

l{^^oSa{Vip)) ^ R{t) + M\og\ogt 

is included in an et-'^ I ^{logtf^/'^ -neighborhood o/Ujgj(c) ^i- 
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Proof. For any point p, we use Lemma A. 1.5 to obtain 

l{^^oSa{Vip)) 

= \ (^2al0g (iV3(logt)f )2 _ log log (iV3(logt)M/a) (1 + ^(1))^ ^ 

Consequently, for all the points under consideration in the lemma, and 
for t large enough, 

tijl^i^d: bi|^erV6(log^)^/"} 

^ alogi + (Af - l)loglogt + 0(1) 
^ f logi+ (M- i)loglogi(l + o(l)) 

^3-^(2M-4)i^^(l + o(l)) 
a logt ^ ^ 

< 3. 

Thus, at most two of the pj's have their absolute value larger than 
ei-V6(logt)A^". Since N{C) = 2, this concludes the proof. ■ 

Now, let us see precisely why the proof of Theorem 8.2.10 breaks 
down. Lemma 8.2.11 still holds for X in J(C) \ J*(C). But it fails 
if X = is in J^^iC), since Lemma 8.3.2 shows that we can have 

I mil as small as we like; and whenever Cj^j vanishes, \mj\ can also 
be as close to as desired. Lemma 8.2.11 was used in deriving the 
expression of qifi{m,v) in Lemma 8.2.12. Then all the proof was 
more or less based on this approximation of (i'j,t(m., f). Notice that 
we can still use this expression whenever we can make the expansion 
which was used in its proof. Thus, for T = in J(C), we still 

parameterize dAi near a point m of Mx, as 

pi{m, v) = m(l + h{v)) + v, ve TmdAi Q TmMj . 

We need to be able to prove that only those v's such that f <C m 
componentwise are of interest for us. Thanks to Lemma 8.3.3, this 
can be done right away on the range where 

\mi\ A \mj\ ^ r^/6(logi)^/" (8.3.1) 
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say. But it can be seen easily that we need a larger range when X 
belongs to J*(C). 

Another difference with the situation of Theorem 8.2.10 is that 
the sets Mx are no longer far apart. It is true that the distance 
between Uxej,(C) Uxej(c)\J,(C) strictly positive. Also 

the distance between Mj and Mj fori and J' distinct in J(C)\ J*(C) 
is positive. This follows from Lemma 8.2.11. However, if and 
{i,k} are in J* (C) with j ^ k then Mij and Mj^^ are at zero distance. 
They connect at infinity on the axis e^M . This can be seen as follows. 
For nik = rrijCij/Ci^k, the corresponding point on Mj^^ given by 
Lemma 8.3.2 is 



— ( rnkCk,k) ei + mkCk 

-\2Q.,m. 2Ch ') '^ Q,.k 



Its distance to 



2(^ ~ "^jCjj) ei + mjej G Mij 



IS 



CijCk^k\'^ _l_ ( 



1/2 



It tends to as rrij tends to 0. This has a dramatic consequence. 
For rrij small, we can have P{i^j}{m, v) — p{i^k}iiT^' , v') for some m in 
Mjj-, some m! in Mj^^ and v, v' satisfying the a priori estimate given 
in Lemma 8.3.2. In other words some components of v, even small, 
may cancel with the corresponding components of m. Thus, the naive 
approach used in Lemma 8.2.12 cannot STiccccd. 

It is essential to understand that what goes wrong here is the pa- 
rameterization of the set dAi . The naive parameterization with (m, v) 
is onto, but is not one-to-one anymore, at least in the interesting range. 
The trick is then to introduce a new parameterization, well tailored 
to handle small components of the points m in Ux6j»(C) Lemma 
8.3.2 shows that when (8.3.1) fails, then one component of m has to 
be of order larger than t^/^(logt)^/°. The corresponding component 
of V in pxim, v) can be neglected thanks to Lemma 8.3.3. Writing p 
for the corresponding point in dAi , we have pi ^ rrii. Since pi is very 
large and Lemma 8.3.3 tells us to look in a neighborhood of m, all the 
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other components of p must be small, going to as t tends to infinity. 
We can then single out another component of say Pk, such that we 
can locate p near Mj^^. To do so, write 

P = Pi^i + Pkdk + w 

where w is orthogonal to ^{i,fe}- Since p is in dAi and Ci^i vanishes, 

1 = 2Ci^kPiPk + PkCk,k + '2Pi{Cei, w) + 2pk{Cej, w) + {Cw, w) . 

Using Lemma 8.3.2, we focus on a zone where pk and w are o(l) as t 
tends to infinity, and obtain 

1 = 2Ci^kPiPk + 2pi{Cei, w) + o(l) . 

Notice that 

Ccj G span{ ei : {i,l} e J*{C) } . 

Thus, whenever {i,l} is in J<t{C), if we can guarantee \wi\ ^ \pk\, we 
obtain 

1 ^ {2\Ci,k\ + \\C\\Vd)\piPk\ + o{l). 

This gives us a lower bound on \pk\. It will be good enough to 
bound \/f,pk away from 0, and to do an asymptotic expansion of 
Sa{\/tpk)f' in the calculation of l[^^o Sa{y/tp)). We will 
then be able to improve the a priori bound of Lemma 8.3.3. Only 
then we will use the parameterization px{m, v). 

Wc have not said how to guarantee that \wi\ ^ \pk\- This is easy. 
Take pfc to be the largest of the |pj|'s for { j } in J*(C). This index 
k depends on p G dAi , and this parameterization is one-to-one up to 
a set of Lebesgue measure zero. 

To proceed along these lines, define the function 



Q(x) = \ 2a\ogy/x T ° ■■ — /'''" — , x>0. 

^ 2V2alogVi ^/2a\og^ 

8.3.4. LEMMA. Letl = he in J{C). Uniformly in the range 

m in Mj and v in TmdAi \ T^Mj such that \/i{\mi\ A \mj\) tends to 
infinity, Vi = o{mi) and Vj = o{mj), 

^^oSa(y/tpi{m,v)) = ^sigii{mk){Q{tml)+o[\og{tml))~^''^^ek 

kex 

+ $"-o Sa{'Proiy±v) . 
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Proof. In the given range, for k = 

{VipT{m,v),ek) = Vimk{l + o{l)) 

tends to infinity. We can apply Lemma A. 1.5 to obtain an asymptotic 
expansion of $*~o S'„((v^pj(m, t;), e;;.))- This gives the result. ■ 

In particular, for the points considered in Lemma 8.3.4, as t tends 
to infinity, 

l(^'-oSa{VipT{m,v))'^ 

= ^ (^alog(Vt|mfc|) - ^loglog(v^lmfcl)^ 

feex (8.3.2) 
- 21og(X,,aa°/220F) + log(27r)<^/2 

+ ^|$^oS'a(Proj^xt>)|2 + o(l). 

As we explained above, Lemmas 8.3.3 and 8.3.4 give us a suitable 
approximation of dB^ near points parametrized by m = mjCj + mjCj 
in Mi J with \mi\ A \mj\ ^ t~^/^(logt)^/". Those points have two 
components greater than i~^/^(logt)^/°(l + o(l)). If now a point 
p in dAi has only one component larger than t~^/^(logt)^/" and is 
in a o(l)-neighborhood of Ujgj(c) then one component, say pi, 
has to blow up with t. Then, p has to be in a o(l)-neighborhood of 
Ufe-Ci fc^o ^i,k- The following lemma gives some control on the largest 
component of Pk for Ci^k nonzero. 

8.3.5. LEMMA. Let e be a positive number. Leti be such thatCi^i is 
null. Furthermore, letp be a point in dAi and in a o{l) -neighborhood 
Ufc C- fc^o -^iifc' as t tends to infinity, with \pi\ ^ e. Choose j such 
that 

\pj\ = max{ \pk\ : Q^fc 7^ 0} . 
Then, for t large enough, 

1 



|C|| + e 

In particular, for t large enough, the inequality \pi\ ^ r forces 



8r(d||C|| +e 
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Proof. Since IJ^.q ^.^q included in the union of planes 

generated by {ei,ek) for Ci^k nonzero, at most two components of 
p are not o(l). Moreover, one component is pi and the other one is 
a Pk for Ci^k nonzero, that is pj. Write p = piCi + pjCj + w with w 
orthogonal to both and Cj . By definition of pj , each component of 
w is smaller than \pj\. From what precedes, the norm of w has to be 
less than Vd\pj\, as well as o(l). Since p belongs to dAi and Ccj is 
in span{efc : Ci^k 7^ 0}, 

1 = {Cp,p) = 2piPjCij + 2pi{Cei, w) + 2pj{Cej,w) + {Cw, w) 
^ 2\piPj\\\C\\ + 2Vd\piPj\\\C\\ + \pj\o{l) + o{l) 

as t tends to infinity. Since \pi\ ^ e, we obtain 

1 ^ \p^Pj\{2{l + v^)||C|| + 4e-i) + 0(1) . 

This implies the first statement of the lemma. The second follows 
trivially. ■ 

The lower bounds in Lemma 8.3.5 are useful only if pi and pj are 
not too large, so that we can have \/t(|pj| A \pj\) going to infinity. 
Thus we need to shrink the domain on which we need to perform the 
integration. This relies on the simple observation that for Student-like 
distributions, a Bonferoni type inequality gives 

P\ max \Xi\ ^ — I < 2di^X, (8.3.3) 
I Ki^d' ' (logt)V" J ^ t«M« ^ ' 

for t large enough. Thus, we define the subset 

A', = ViA[ = Vi{peR' : (Cp,p)^l, max |p,|^^^^^|i^}. 

L l^t^d (logi)^/" J 

We denote B'^ = ^^(v^^). 

We can now improve dramatically upon Lemma 8.3.3. 

8.3.6. LEMMA. The set of dip's in dA[ such that 

l{^^o SaiVtp)) ^ R{t) + Mloglogt 

can be parameterized as p{m, v) with m in some Mj, some I in J{C) 
and 

Vvo]y^^V = 0(t-l/2(logt)^/(2«)(loglogt)V(2")) . 
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Moreover, whenever |mj| A \mj\ S> t ^/^(logt)''^/^^"^(loglogt)^/^^°), 
then Vi = o{mi) and Vj = o{mj). In addition, 

/($^o SaiyftA'^)) ^ R{t) - alogT - a log(16d ||C||) + o(l) 

as t tends to infinity. 

Proof. Let p be a point in dA[ as in the statement of the lemma. 
Lemma 8.3.3 guarantees that it belongs to an o(t^^/^(log)^/")- 
neighborhood of Uiej(c) o(i~^/^(logt)*^/")-neigh- 
borhood of UjeJ(c)\J*(c) ^^2:, then we are done thanks to Lemmas 
8.3.4 and 8.2. IL Thus, assume that it is not. Let i be such that 

\pi\ = max{ \pk\ : 1 ^ k ^ d} . 

Since 1 = {Cp,p) is less than ||C|||pp, we have 

\p,\^l/d\\C\\^t-y^{logtf'/^. 

Therefore, p is in a o(t~^/^(log f)^/")-neighborhood of Ufe-c k¥=o ^i,k- 
Since p belongs to A[ , 

log log t 
^ (logt)Va • 

Let pj be as in Lemma 8.3.5. The second inequality in Lemma 8.3.5 
gives 

(logt)V" 1 
" VtloglogU6d||C|| ■ 

In particular, both and v^|pj| tend to infinity. Write p = 

Pi^i + Pj^j + w with w orthogonal to both Cj and ej. We calculate 

+ ^|$^o5«(Vtw;)P + log(27r)<^/2 
= alog(v^|;?j|) - ^loglog(\/t|pi|) +Q;log(\/t|pj|) 



- ^ loglog(Vtb, |) - 21og(i^,,aa"/220F) 
+ ^|$^o + log(27r)'^/2 ^ ^(1^ 



152 



Chapter 8. Application to quadratic forms of random vectors 



as t tends to infinity. From the first estimate in Lemma 8.3.5, we 
deduce 

a log(v^|pi I) + a log{Vi\pj\) ^ 2a log - a log(16 d\\C\\) . 
Moreover, since log(l + h) ^ h for any h > —1, we also have 

\oglog{Vi\pi\) = log(logV^ + log|pi|) ^ \og\ogVi+- — ^ . 

log Vt 

Thus, 

/($^o SaiVip)) ^ 2a log Vt - log log Vi - 2 log{K,^^a''/^2^) 

+ log{2TTf/^ - a\og{l6d\\C\\) + o{l) 

+ ^\^^oSa{Viw)\\ 

where the o(l)-term does not depend on p in A[. This gives the lower 
bound on /(^'"o Sa{\/iA[)) stated in the Lemma. Moreover, the 
inequality 

l{^^oS^{Vip)) ^ R{t) + M loglogt 

implies 

\^^o Sa{\/iw)\^ ^ Mloglogt + c 
for some constant c. This forces, for any k not in { i,j }, 

Vi\Wk\ ^ S^O(f){^Mloglogt + c) = (l0gi)^/(2a)(iQgiQg^)l/(2a)Q(i) 

as t tends to infinity, thanks to Lemma A. 1.6. 

Set I = We just showed that the projection oip on is at 

most of order t~-'^/^{\ogt)^/^'^"\\oglogt)^/'-^°'^ as t tends to infinity. 
From Lemma 8.3.3, we then know that p = p{m, v) for some m in 
Mx and v in T^dAi \ TmMx with |Projvv^^| = o(t-V6(iogt)A^/a). 
Therefore, if \mi\ and \mj\ are at least t~^'^{logt)^/°' , the lemma is 
proved. Our choice of i implies that the only case left to investigate 
is {nij I ^ t-^/6(logt)^/". In this case, Lemma 8.3.2 shows that C'i i 
must vanish — because nii is lower bounded if Cjj is not null in 
Lemma 8.3.2; this is what we are claiming up to a permutation of i 
and j. Then, Lemma 8.3.2 shows that rui ~ l/{2Cijmj) as t tends to 
infinity. Since v is in T^dAi Q Ty^Mj, it is orthogonal to the vector 
spanning T^Mj, that is, to 
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Thus, there exists a real r such that 

Projv.^^; = re, + ^ (-^ + C,- e,- . (8.3.4) 

Since v belongs to TmdAi, we must have (Cm, v) = 0, or, equivalently, 
(Projy^Cm, Projy^i;) = (CTO,Projy^i;) = -(Cm, Projyxi;) . (8.3.5) 
Since Cj^j is null, 

(Projy^ Cm, Projy^v) = - rnjCjj^ ej + rrijCijei + mjCjjej , 



2C,: 1 Vm^ -^'-^ ' ■' 



r 

r-jCii + 



Thus, if nij tends to as i tends to infinity, 

(Projy^ Cm, Projv.^^;) ~ ' 3 • 

On the other hand, using the bound on the projection of p on that 
we obtained earlier in the proof of this lemma, 

KCm, Projyx^;)! ^ l|C|l 1"^! |Projyx(t;)| 

= ^0(rV2(logf)^/(2«)(loglogi)V(2a)) . 

\mj\ 

Therefore, (8.3.5) yields 

r = m20(rV2(iogi)A^/(2«)(ioglogt)V(2a)) . 

as t tends to infinity. Therefore, going back to (8.3.4), 

Projy^^; = o(mOei + (O (t-i/2 (log t)^/(2a) (iQg ^) i/(2a) ) j g . _ 

This concludes the proof of Lemma 8.3.6. ■ 

Though we do not need this right now. Lemmas 8.3.3 and 8.3.6 
imply 

I{Bl) ~ R{t) a log t as t — > (X) . 
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More importantly, if i belongs to X = {hj} € J*{C), and m is 
in Mj, Lemma 8.3.2 shows that rrij ~ l/2Cijmi as tends to 
infinity. Therefore, Lemma 8.3.6 allows us to use the expansion of 
Lemma 8.3.4 in the range \mi\ < v^/(logt)^/(2a)Qogiog^)i/(2a)_ 

To conclude what we have done so far, we can use Lemma 8.3.4 

over all points q = $^ o Sa{Vip) in Bf, with p belonging to A[, 
except for those in an 0(t^^/^(logt)^/")-neighborhood of a point 
m = mjCj + TTijej with {i,j} in J^{C) and, say, 

Vi/{logt)^/^'^"\log\ogt)^/" ^ \mi\ V \mj\ ^ MVi/(logt)^/" . 

We now discard this missing range by an ad hoc argument. The 
key observation is that Lemma 8.3.5 guarantees that this set of p's is 
included in 

Vt]og]ogt I ^ 1 1 

^ (logt)V" ' i^^^^i ^ i6d\\c\\r 



We now show that ultimately we will be able to discard the set Qt 
in our computation. 

8.3.7. LEMMA. We have 

P{X e Vint} = o(^-^^ ast^oo. 



Proof. The upper and lower tail of the Student-like distributions 
are asymptotically equivalent. The result is then a consequence of the 
independence of the Xj's and the following calculation, 

p r t < < t^oS^ogt ■ XX > ^ \ 

I (logt)A^/(2")(loglogi)Va ^ ^ (logt)i/« ' ^'^^ ^ 16d||C|| J 

/•tloglogt(logt)-i/« 

= / 1- Sa{t/16d\\C\\x)dSa{x) 

Jt(log t)-*f/(2«) (log log 

/•Uoglogt(logi)-i/- a 
= 0(1) / —dSaix). 

it(logt)-*f/(2«)(loglogt)-l/a 
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An integration by parts shows that the last integral is 

fal-^ W"V-^^ -^^.'Jt(logt)-A^/(2")(logIogt)-V« 

/-tloglogtOogt)-!/" 
+ — / X'^-^I - Saix)) dx . 

Jt(logt)-Af/(2a)(loglogt)-V« 

Since x°'{l — Sa{x)) tends to a constant as x tends to infinity, the 
above sum is 



/■t log logt(log 

o(r") + 0(l)t-° / x-^dx 



ft log logt(log 

t (log t ) - ( 2a) (log log t ) - 1 / « 

o{t~") + 0(i)r"iogiogt = o(r"iogi) 



as t tends to infinity. ■ 

Define 

Combining (8.3.3) and Lemma 8.3.7, we obtain 

P{{CX,X)^t} = l,,e-^^'^dy + o(^-^). 

From now on, we concentrate on the integral of on B'^'. Combin- 
ing Lemmas 8.3.4, 8.3.5 and 8.3.6, the set B" f] TR(t)+M logiogt can be 
identified, for t large enough, with some points ^^^o Sa{\/ipi{m, v)) , 
with 

I G J(C7), m G Mj, ve TmdAi \ TmMj 

|Projyx| ^ r (log t)2A^/° (log log i) 1/(2")^ 

minfcej|mfc| ^ t-V2(loglogt)i/"(logt)A^/(2")^ 
and Projy^f = o{m) componentwise. 

For those points, we can use Lemma 8.3.4 and (8.3.2). This allows 
us to obtain the value of I{B") up to an o(l)-term as t tends to infinity. 

8.3.8. LEMMA. The equality I{B'I) = R{t) + o(l) holds as t tends 
to infinity. 
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Proof. Making use of formula (8.3.2) we first need to evaluate the 
minimum of the function 

(alog{Vi\mk\) - ^loglog(Vi|mfe|)) 

feex 

on Mx- We also need an approximate location of the minimum, to 
check that ^^(v^ •) maps it to a point in B'/ — and not in Bf. 
When Z is in J(C) \ J*{C), we can argue as in (8.2.9). We concentrate 
on the case where X = { i, j } belongs to J*{C). Considering Lemma 
8.3.2, this leads us to define the function 

ft,Cj,Mj) = "log (21^ ^ ~ "^^^^'^ ) + "log(Vi|mj|) 



^2|Cij| 



When Cjj- vanishes, this function is 



ftfi{rnj) = a\ogt - alog(2|Cij|) - ;^ log log ^ 



2 \Ci,jmj\ 

- ■^^og\og{\rt\mj\) . 

The function ftfli') is minimum when r = log \mj\ maximizes 
(log Vi - log \Cij I - r) (log Vi + r), 

that is 



1 
2 

Moreover, as t tends to infinity, 



/t,o(-^) = alogt - alog(2|C,j|) - loglog + o(l) 



rrii 



When ruj = Xj ^\Ci^j\ and C^j vanishes, Lemma 8.3.2 gives 
sign(Cij)/(2^|Cij|). Clearly mjCi + mjCj is mapped into B'l by 

*^o5a(v^-)- 

If Cj j is nonzero and rrij is positive and fixed, then 

ft,Cjj{mj) = alogt - alog(2|Cij|) + alog(l - rrijCjj) 

-loglogVt + o(l) (8.3.6) 
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— recall that Cjj is negative. Thus, for ft,Cj ji'^'j) to be minimum, 
we must have tends to as t tends to infinity. But if Vtpi{m, v) 
belongs to B'/, expansion (8.3.6) always holds since \mi\ A \mj\ ^ 
l/Vi. Taking \mj\ = 1/logi for instance, we have 

ft,Cj,j (1/ log t)=alogt-a log(2|Cij I) - log log Vi + o(l) . 

as t tends to infinity. Given the uniformity over B" in (8.3.2), this 
gives the asymptotic minimum of ft,Cjj, and ultimately I{B'/). ■ 

We can now apply Theorem 5.1. Denote by 

Pt = ^2I{B'/)-log{2nr/^ 

the radius of the ball ^i(Bt) ■ -^-s we have seen after the proof of Lemma 
8.3.6, 

pt ~ \/ 2a log t as t ^ oo . 
In view of Lemma 8.3.4, for I in J(C) and m belonging to Mj, define 

9x,t(m) = ^sign{mk)Q{tml)ek 
and its projection onto ^i(Bt) through the normal flow, 

ri,t{m)=pt- -—. 

We consider the dominating manifold of dimension A; = 1, 

'^B'/= U {rx,t(m):mG [j Mx^m^Qt] 
ieJ{C) xeJ(C) 

Vtloglogt , / \\ / 1 1 A 
KflJ"^^' ^ (logt)Vc^ ^^B'A^^M)) ^ l°gl°g*| • 

It is fairly clear that V^n is a dominating manifold for the set 
B" {^V j(^B'^^+M\ogiogt- We can take M large enough such that 



i?.{t) + M log log t 



When applying the formula given in Theorem 5.1, the integral over 
Vqii splits into two parts. The first part comes from the contribution 
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of points r-x^t{m) with Z belonging to J{C) \ J^{C). This part is 
exactly like the integral we dealt with in the proof of Theorem 8.2.10. 
It is of order Thus, we concentrate on the second part, coming 

from points rx,t{m) with X in J*(C). 

Let X = be in J*(C) and rx,t(m) be a point of I'b^'- From 

(8.3.2) we infer 

Ts^'(rj,t(m)) = ]^{\qx,t\^ - pf) 

1 log(Vt|m^|) 
= alog \mimj\ - alog7 - - log — ; ^= — 

2 log \/t 

Consequently, 

exp - rB//(rx,t(m))j 



\mimj\'^^^ \ log\/i/V logVi^ 

uniformly over the part of V^'^ corresponding to points rx,t{Tn) with 
m in Mx- Notice that if (Tj j is nonzero, the inequality 

tb'^ {rx,t{m)) ^ M log log t 

required in V^n imposes \mj \ ^ (log i)2M/a j^j. ^ ^g^j-gg enough; this can 
be seen from the above expression of rg/' (ri,t(m)) and using the same 
lower bound argument as in the proof of Lemma 8.3.6 to handle the 
term in log ( log(\/t|mj|)/ log \/t) . Therefore, when applying Theorem 
5.1 to a component coming from M^j with Cjj nonzero, we need only 
to integrate for \mj\ ^ (logt)^^/". 

Still in order to apply the formula in Theorem 5.1, on T>^', 

\l)l[rx,t{m)) I = \rx^t{m)\ ^ pt ^/2alogt as t ^ 00 . 

We can now calculate the matrix Gs^' (rx,t(m)) . In the range of 
m's that we are considering in T^^i, Lemma 8.3.4 shows that dB'/ 
behaves like a ruled surface made of (d— 2)-dimensional flat subspaces 
in directions orthogonal to Vx- Hence, the very same argument as in 
Lemma 8.2.8 shows that 

Gdii (rxti'm)) ~ ^.'^^''^ ^ ■ as i ^ 00 , 
^* ^ V2alogt 
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uniformly over the part of Mj we are considering. We can then write 
explicitly the contribution of the part related to Mx in the formula 
given in Theorem 5.1. This contribution is 

where we integrate over all points rx^t{iT^)-, for m in Mj, with |mj| V 
\mj\ ^ log log t/ (log 0^/"; and \mj\ ^ (logt)^*^/" if Cjj is 
nonzero. As in Theorem 8.2.10, we can replace the Riemannian 
measure on rx^t(Mi) by that on qx,t{^x)- Its expression in the local 
parameterization given by ruj in Lemma 8.3.2 is 

2\ 1/2 



Since Q'{x) ~ ^/a/{2x\/\ogx) as x tends to infinity, this expression is 
equivalent to 

a / drrii \ 2 a \ ^^"^ ^ 

^ m? log {tml ) V drrij ) m| log (im^ ) / ^ ' 

Putting all the pieces together, the contribution to the integral in 
Theorem 5.1 coming from Mx is 



(27r)^'^-2)/2^ 



/ 



^ ^ log|m^| \ _^ fog ^ 



7" V V fogVt A logVt / ^ 
I rmmj I " (2a log t)^/^ (2a log t ) -(<i-2)/4 

a / dm,- \ 2 a ^ "^^^ 



+ ^2 1„^..^2^ dm,- , (8.3.7) 



^m^ log(tm^) Vdmj / mj log(tmj 
where we integrate over m, such that 

if Cjj is nonzero, and 



(logt)W(2«)(loglogt)V« 



160 



Chapter 8. Application to quadratic forms of random vectors 



if Cjj is null. The finale of the proof consists in showing how simple 
this integral is, at least asymptotically! Up to multiplying it by 2, we 
can restrict the range of integration to mj positive. 
We first integrate in the range 

(log (log log V" (1 + o(l)) 1 



2\C, 



logt 



In this range, rrii ~ l/{2Cijmj), and 

1 /dmi\2 1 



Vdmo 



m 



2 • 



Using the change of variable rrij = t^, the integral becomes — on that 
range — 

^logv^ ^s,a""2 



V2alogi 



where we integrate for 

1 M log log t 
~2 2a logt 

This gives a term 



J |2Qjr(l+o(l))^(l-2s + o(l))(l + 2s)x 

1 / 1 1 \l/2 

Va ; iz r 77T + z ^ \ogtfds 

t^Vlogt VI -2s + o(l) l + 2s/ ^ 

J V2 + o(l)ds, 



;i + o(i)) 



log log t 
logt 



When 1/logt ^ rrij ^ (logt)^^/"^, then log|mj| and log|mj| are 
O(loglogt). This part of the integral contributes a term less than 



log 
t 



2 "|Q,,|" (loglogt)V^ 1 
mpjj)- (logt)V2 



1 /dm,\2 1 \i/2 



V dm,- / 



+ 



drrij . 



Since Cjj is nonpositive, 1 — rrVjCjj ^ 1. Therefore 



1 drrij 




1 l + m^C,-, 


mj dnij 




m^- 1 - mjCjj 



m-j 
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The contribution in the integral is of order at most 

^/loglogix /-(log dm, ^ ^/ (loglogt)^ \ ^ ^/logU 

Consequently, if Cjj is not zero, (8.3.7) is equivalent to 

as t tends to infinity. 

If Cjj vanishes, we need to add a contribution for the part where 

(logt)2^/"^m,^ ^ 



(logt)A^/(2a)(loglogt)V" ■ 

We argue as in the case nij ^ 1/ log t above — this amounts to exploit 
the symmetry between rrii and rrij when mj is large. Therefore, if Cjj 
is null, (8.3.7) is equivalent to 

as t tends to infinity. Putting all the estimates together, we obtain 

B't' t 

5Z |2<^ij r (Ir\{0} (C'i.i) + 2I{o} (C'i.i)) (8.3.8) 

(i,i)GJ.(C) 

Noticing that 

X] |2Cjj|''(l]R\{o}(Cj,j) + 2I{o}(Cj,j)) = |2Cjj|°, 

{i,j}eJ*(C) i:Ci,i=01<j^d 

we see that (8.3.8) is the expression given in the statement of Theorem 
8.3.1 since we replaced C by (C + C'^)/2 in this proof. 

To conclude the proof, wc need to check the assumptions of Theo- 
rem 5.1. This part of the proof of Theorem 8.2.10 can be copied almost 
word for word, and this concludes the proof of Theorem 8.3.1. ■ 

Combining Theorem 8.2.21 and Lemmas 8.3.2, we obtain the 
following result. 
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8.3.9. COROLLARY. Let X he a d- dimensional random vector 
with independent and identically distributed components, all having 
a Student-like distribution with parameter a. Let C be a dx d matrix, 
with N{C)>2. Then 

lim t"P{{CX,X) ^t} = 0. 

t— »oo 

Proof. Notice that if the largest diagonal element of C is positive, 
then N{C) = 1. Moreover, if this largest diagonal term vanishes, 
Lemma 8.3.2 gives N{C) = 2. Then, N{C) > 2 imphes that all 
the diagonal coefficients of C are negative. Consequently, Nq{C) 
— defined after (8.2.19) — is at least 2. Apply Theorem 8.2.21 to 
conclude. ■ 

At this point, the reader who doubts of the usefulness of Theorem 
5.1 in providing a systematic technique should try to obtain the result 
of this chapter by other methods. Maybe once the results are known. 
Theorems 8.2.10, 8.2.21 and 8.3.1 can be proved more simply. It is also 
hoped that though Theorem 8.2.1 can be derived by easier methods, 
the path taken makes the current proof rather didactic. As it may 
have been noticed. Theorems 8.2.10 and 8.3.1 add extra arguments to 
the basic ones developed to prove Theorem 8.2.1. 

Notes 

This chapter is motivated by the statistical applications in time series 
developed in chapter 11. 

Concerning section 8.1, there is a classical argument for the Gaus- 
sian case. Replacing C by (C + C'^)/2, there is no loss of generality in 
assuming that C is symmetric. Thus we can diagonalize the matrix, 
writing C = QDQ^ for a diagonal matrix D and an orthogonal one Q. 
Define Y = Q'^X. Since the standard Gaussian distribution is invari- 
ant under orthogonal transformation, Y has again a standard normal 
distribution. Thus {CX,X) — X]i<j<(i ^ weighted sum of 

independent chi-square random variables. This can be generalized to 
noncentered Gaussian distributions — see Imhoff (1961) — and opens 
the door for saddlepoint approximations — Barndorff- Nielsen (1990). 

The orthogonal invariance argument can be used for other ad hoc 
distributions. However, the Gaussian one is the only orthogonally 
invariant distribution with independent marginals. 
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For quadratic forms with heavy tail distribution, not much seems 
to be known. Davis and Resnick (1986) contains some asymptotic 
results as d tends to infinity in a time series context, based on the 
point process technique exposed in chapters 3-4 of Resnick (1987). 

This chapter 8 is full of open questions. To state a few, what 
happens in the degenerate cases when N{C) > 2? Can one find a 
closed formula for the integral involved in Theorem 8.2.10? What is a 
good numerical scheme to compute such integral? Can we find more 
terms and obtain an asymptotic expansion? Can one obtain good 
upper bounds instead of asymptotic equivalents? Answers to these 
last two questions would be useful in the applications developed in 
chapter 11. Can one prove conjecture 8.1.1? 

When dealing with heavy tails, there is a fashionable extension 
which consists in replacing any power function by itself times a slowly 
varying function. This is done mainly for linear functions of random 
variables. For quadratic functions, things turn to be much more 
complicated, and the classical guess, consisting of putting the same 
slowly varying function in the tail equivalent, is plain wrong. This can 
be seen already when multiplying two heavy tail random variables. 



9. Random linear forms 



In this chapter, we investigate the following problem. A d-dimensional 

random vector X = [Xi , . . . , X^) defines a random linear form 
X{p) = {X. p) on R'^. Given a snbsct M of M"* — eventually M could 
be of dimension much smaller than d — we can look at the restriction 
of the linear form on M, and at the distribution of its supremum, 

X{M) = sup{ {X,p) -.peM}. 

Writing 

At = {xeR"^ : sup {x,p) ^t} =tAi, 

we see that 

P{ X{M) ^t}= [ dP 

JtAi 

where P is the probability measure of X. Provided is not in the 
closure of Ai, the sets tAi = At are moving to infinity with t. Theorem 
5.1 may provide tail approximations of the distribution of X{M). 

There has been a tremendous amount of work on this problem, 
but from a somewhat different perspective. Traditionally, the set M 
is parametrized as Af = | p{s) : s £ for some set S and some 
function p : S C M.^ ^ M."^. When /c = 1, the random variable 
X{M) is the supremum of a linear stochastic process; for /c > 1, 
it is the supremum of a linear random field. Though it is widely 
used, parameterizing the set M has the disadvantage of hiding the 
fact that X{M) is completely parameterization free. Any surjective 
— even not smooth, not injective, not measurable, etc. — change of 
parameterization of M leaves X{M) invariant. 

Often, one is interested in the supremum norm 

\X\{M)=sup{\{X,p)\:peM}, 

more than in X(M). Up to changing M into MU (— M), it is enough 
to consider X{M). 

The key point to understand is that obtaining an approximation 
of P{ X{M) ^ t } for large t covers several distinct questions. First, 
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notice that Ai is the complement in of the convex set 

CM = {xeR'^: x{M) ^ 1 } = fl {x:{x,p)^l} 

— an intersection of half spaces. So, a first question is to understand 
how the probability that X{M) is large is related to the geometry of 
OCm- a second question is to study how the geometry of OCm is 
related to that of M. When dealing with a parameterization of M, a 
third question is how to read in the parameterization the geometric 
information we need about M. 

Among some amusing features of our point of view, we cannot resist 
mentionning the degenerate case M = { (1, . . . , 1) } C M'^. Then, one 
has X{M) = Xi + ■ ■ ■ + Xci- From our point of view, approximating 
the tail distribution of a sum of random variables, approximating the 
tail distribution of the supremum of a linear process, approximating 
the distribution of the supremum of a linear random field are all the 
same problem. 

As in the previous chapters, we will investigate light and heavy tail 
distributions. Our purpose is again to illustrate the use of Theorem 

5.1 and to show that very different geometric features govern the tail 
behavior of X{M) according to the distribution of X. 

9.1. Some results on convex sets. 

In this section we relate the set M c to the convex set Cm ■ Notice 
that Cm\jm/i = Cm- Hence, Cm does not characterize M. To study 
which part of M is characterized by Cm, we first prove that we can 
also assume M is closed. 

9.1.1. LEMMA. For any subset M of W^, the equality C^m = 
Cm holds. Moreover, Cm contains the origin if and only if M is 
unbounded. 

Proof. The inclusion of M in its closure implies that of CdM in Cm- 
On the other hand, any point p in the closure of M is the limit of a 
sequence of points pn belonging to M. Any point x in Cm satisfies 
{x,Pn) ^ 1 and consequently {x,p) ^ 1. This proves the inclusion of 
Cm in Ccim- 

Thc second statement in the lemma follows from the equivalence 
between the inclusion of the ball of radius e centered at the origin in 
Cm and that of M in the ball of radius 1/e around the origin. ■ 
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In what follows, we assume that M is closed and bounded, i.e., 

M is compact in W^. 

Another way to think of this assumption is that we can take M to 
be closed since X{M) = X{c\M). Moreover, X{M) is infinite almost 
surely if and only if M is unbounded and X is nondegenerate. In 
short, the behavior of X(M) is trivial if and only if M is unbounded. 
So, we may as well assume M to be compact. 

Recall that for a convex set C, there is a dense set in its boundary 
containing points for which the tangent space to dC is well defined — 
see, e.g., Schneider (1993). Consequently, it makes sense to define 

Mo = c\{p : there exists x e BCm , {x,p) = 1, 

TxBCm exists, and p _L T^dC } 

The next proposition shows that Mq is the smallest closed set in M 
necessary to describe BCm- 

9.1.2. PROPOSITION. Assume M is compact. 

(i) The inclusion Mq C M holds. 

(ii) Moreover, Cm = Cmq- 

(Hi) If Ml is closed in M and Cmi = Cm, then Mq c Mi. 

Proof. We will use the following claim: If x is in DCm and TxOCm 
exists, then there exists p in M, orthogonal to T^BCm-, such that 
{x,p) = 1. Indeed, for such x, there exists p in M with {x,p) = 1. 
Let u be a tangent vector to Cm at x. We can find a curve a:(e) in 
OCm such that x{e) = x + eu + o(e). Since x{e) is in Cm, we must 
have 1 ^ {x{e),p) = 1 + e{u,p) + o(e) as e tends to 0. Thus, {u,p) 
vanishes, and indeed, p is orthogonal to T^OCm- 

(i) Let g be a point in M*^, such that there exists x in OCm with 
{q,x) = 1, the tangent space T^OCm exists, and q is normal to OCm 
at X. By the above claim, there exists p in M and orthogonal to 
TxQCm such that (x,]?) = 1. Since M is compact, is in the interior 
of Cm and T^OCm is of dimension d—l. Consequently, p and q must 
be collinear. They are equal since {x,p) = {x,q) = 1. Thus q belongs 
to M. Since M is closed, the inclusion Mq C M follows. 

(ii) The inclusion of Mq in M implies that of Cm in C'mq- To obtain 
the reverse inclusion, convexity of Cmq and Cm shows that we just 
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need to prove BCmq C OCm- Assume that OCmq \ QCm contains a 
point X. Then 

^ 1 for all j9 G Mo . (9.1.1) 

Since Cm is included in Cmq-, such an x cannot belong to Cm- 
Moreover, since Cm contains the origin and x is not in Cm, there 
exists A in (0, 1) such that y = Xx belongs to OCm- For any positive 
e, there exists in OCm such that \y — ye\ ^ e and Ty^dCM exists. 
Using our claim, there exists in orthogonal to Ty^dCM and such 
that {pe,ye) = 1, i.e., Pe belongs to Mq. Since Mq is closed and is in 
the compact set M, it is compact. As e tends to 0, the points Pe 
admit a cluster point p belonging to Mq, thus belonging to M. Then, 
as lim^-vo Ve = y, 

1 = {p,y) = Mp,x) . 

Consequently, {p,x) = 1/A > 1, which contradicts (9.1.1). It follows 
that OCmq C OCm, and therefore Cmq C Cm- 

(iii) is clear from (i) and (ii), and shows the minimality of Mq. ■ 

Proposition 9.1.2 motivates the following definition. 

DEFINITION. We say that a set M is reduced if M = Mq. 

If OCm has a well defined tangent space at x, it admits a unit 
normal vector N(x) pointing outward from C. The claim in the proof 
of Proposition 9.1.2 shows that there exists p in M, orthogonal to 
TxQCm-, and such that (p, x) = 1. Such a p is collinear to N{x) and 
satisfies \p\{x,N{x)) = 1. In conclusion, 

Mq = c\{N{x)/{x,N{x)) : x € OCm , T^OCm exists} . (9.1.2) 

This set is called — traditionally assuming that OCm is smooth — 
the polar reciprocal of 9Cm — see, e.g., Schneider (1993). 

Whenever OCm is locally a C^-manifold, the following lemma 
shows that Mq is also locally a C^-manifold. Moreover, the second 
fundamental form of Mq is related to that of OCm- 

9.1.3. LEMMA. Let x : U C W^'^ ^ OCm be a local parame- 
terization of OCm- Then, xq = N o x/{x, N o x) defines a local 
parameterization of Mq. If {bi,j)i<:i jKid-i ^he the matrix of the 
second fundamental form of OCm o,t x in this parameterization, then 
bo,i,j = {{^{x) ,x)\x\) is the second fundamental form of Mq in 
the corresponding parameterization xq . 
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Proof. We follow closely Hasani and Koutroufiotis (1985). Let 
u = {ui, . . . , Ud-i) £ U. Define Xi = dx/dui and Ni = dN o x/dui. 
The function xq = N ox/{x,N ox) defines a local parameterization of 
Mo according to (9.1.2). Define f = {x, N o x). Since {xi, N ox) =0, 
we have 

dxo Ni {x, Ni) 
Moreover, x is normal to Mq at xq since 



{xo,i, ^) = ^ i^i^ ^) - Ni)j = 0. 

for all i = 1, . . . ,d — 1. Thus, the components of the second 
fundamental form of Mq are 

/dxo 1 dx\ /Ni {x,Ni) Xj\ 1 



In Lemma 9.1.3, we derived properties of Mq from knowledge on 
OCm- We will also need to go the other way, that is obtain some 
information on the boundary of Cm from the knowledge of M or Mq. 

Our next result shows that whenever Cm is bounded, an half line 
starting at the origin can cut Mq in at most one point. 

9.1.4. LEMMA. Assume that Cm is compact in W^. If q is in Mq, 
then qR+f]MQ = {q}. 

Proof. Consider a point q in Mq and A positive such that Ag 

belongs to Mq as well. Representation (9.1.2) of Mq implies that 
q = lim„^oo N{xn)/ {xn, N{xn)) and \q = lim„_^oo N{yn)/ {Vn, N{yn)) 
for points Xn,yn in <9Cm at which OCm has a well defined tangent 
plane. 

Both OCm and Sd-i are compact. Up to extracting a subsequence, 
we can assume that x^i^ Vni N{xn) and N{y n) converge respectively 
to X, J/, A^i, N2- Then q = Ni/{x,Ni) and Xq = N2/ {y-,N2) ■ Since q 
and Xq are collinear, we have A^i = 7V2 or A^i = — 7V2. Since Cm is 
convex and contains the origin, [x, Ni) and {x, N2) are nonnegative. 
The positivity of A implies A'^i = N2- 

Next, Cm being convex, it lies on one side of its tangent spaces. 
Thus [xn — Vn, N{xn)) ^ 0. Taking the limit as n tends to infinity, we 
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obtain {x—y,Ni) ^ 0. Permuting x and y yields {y—x,N2) ^ 0. Since 
Ni = N2, the vector x — y is orthogonal to A^i and {x, Ni) = {y, N2). 



As a consequence of Lemma 9.1.4, the next result asserts that 
whenever Cm is compact and Mq is a manifold, the space pU. is 
transverse to TpMg for a typical point p in Mq. 

9.1.5. LEMMA. If M is compact in R'^ and Mq is a manifold, then 
{p & Mq : pM C TpMQ } is nowhere dense in Mq. 

Proof. Assume that the set under consideration is dense in an 
open set U of Mq. Since Mq is smooth, pM. is included in TpMo 
for every p in U. Up to considering an open subset of U, we can 
assume that U does not contain the origin. Thus, p/\p\ is a unit 
vector field in the tangent bundle of Mq. Let 7 be an integral curve 
of this field, with 7(0) in U. It satisfies the equation 7' = 7/I7I. 
Consequently, 7 = I7I7'. Differentiating, we obtain 7' = 7' + I7I7". 
Hence 7" vanishes since 7 does not. Consequently, 7' is constant and 



7(5) = 7(0) + s-^j^ = 7(0) (1 + y^)- Hence the curve 7 is in the 



Making use of Lemmas 9.1.1-9.1.3 and Proposition 9.1.2, we can 
reduce M to Mq. Moreover, Mq is smooth if OCm is. In the following, 
we assume that 

M is reduced and is a smooth m-dimensional submanifold of M'^. 



As announced, our next task is to relate the differential geometric 
properties of OCm to those of M. For this purpose, we build a local 
parameterization of OCm starting from one for M. Lemma 9.1.5 
asserts that under (9.1.3), if Cm is compact, then, at a typical point 
p of Mq, the direction pM is not contained in the tangent space TpMQ. 
Actually, the proof of Lemma 9.1.5 shows slightly more; namely, that 
we cannot have pR included in TpMQ along a submanifold of Mq. We 
will assume more, even when C is not compact; namely that given pq 
in M, 

pM. is transverse to TpM for all p in a neighborhood of pq. (9.1.4) 
Consider a local parameterization 



Thus, A equals 1. 




(9.1.3) 



p : {ui,...,Um) eU 



l-> p{ui, . . .,Um) e M 
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of M around pq. We can assume without any loss of generality that 
U is an open neighborhood of the origin and that p{0) = pQ. For 
what follows, it is convenient to extend p to a map defined on a 
neighborhood of the origin of M'^~^. Thus, let F be a neighborhood 
of in R'^^^""* and consider 

p:ueU xV CR"^ X M-^-i-" ^ p{u) = p{uu ...,Um). 

Under (9.1.4), to each point p near po, we can associate a unit normal 
vector 

lyip) € {TpM +pR)QTpM; 

that is, ij{p) is normal to TpM in TpM + pR. 

Let X„i-|-i(p), . . . , be an orthonormal moving frame in 

{TpM + pR)^ = R'^ Q {TpM + pR) = R'^'"^-^. For u in U x V, 
let 

{uop{u),p{u)) 
where the function ^j^s are and such that 

{Um+l,---,Ud-l) eV<-^ l{X{ui, . . . ,Um,Um+l, ■ ■ ■ ,Ud-l)) 

is minimum at — the existence of these functions and their smooth- 
ness comes from the implicit function theorem and the smoothness of 
/. By construction, X{u) is normal to Tp^y^^M for all u in ?7 x V. 

Our next lemma asserts that if M is positively curved, then X 
defines a local parameterization of OCm- Combined with Lemma 
9.L3, it allows us to parameterize M or OCm, whichever is the most 
convenient. 

9.1.6. LEMMA. Under (9.1. 3)-(9.1. 4), if the second fundamental 

form of M relative to the normal field X is definite positive at every 
point, then X{u) defines a local parameterization ofdCM- Moreover, 
p{u) is outward normal at OCm at X{u). 

Proof. Denote by Il-^{p) the second fundamental form of M at 
p along the normal field X. Notice first that X{u) is orthogonal to 
Tp(^y-)M. Also, 

{p{u),X{u)) = l, 
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and p is orthogonal to span{ X^+i, . . . , Xd-i }. Consequently, if q{s) 
is a curve on M, parametrized by arc length, such that ^(O) = p{u) 
and q'{0) = t, then, as s tends to 0, 

{q{s),X{u)) = l+'-^{q"{0),X{u))+o{s') 

= l-yn^(p(«))(i,i)+o(s2). 

Thus, s {q{s),X{u)) is maximal and equals 1 at s = 0. This 
proves that X{u) is in OCm and that X{u) indeed defines a local 

parameterization of OCm- 

To prove that p{u) is normal to OCm, notice first that 

d 

— — X(u) = Xi(u) for i = m + 1, . . . , d — 1 . 

dui ^ ' 

Furthermore, writing Vi = -^^^ ° p{u) and Pi = ■^j-p{u), for 
i = 1, . . . ,m, 

+ XI {h'^P^ «J )d^j {P) ■ Pi ■ 

Prom the very definition of Xj, for j = m + 1, . . . , d — 1, we infer 

{Xjop[u),p{u)) = ^ for j = m + l,...,d- 1. (9.1.6) 
Differentiating this equality yields 

(dX,(p)-pj,p) + (X,op,Pj) = 0, i = l,...m, j = m+1, . . . ,d-l . 

Therefore, since Xj is orthogonal to TpM, we have 

{dXj{p)-pi,p) = ioi j = m + l,...,d-l. (9.1.7) 

Combining (9.1.5)-(9.1.7) yields {Xi,p) = for i = 1, . . . , m. Thus, 
p{u) is normal to OCm at X{u). 

Since Cm contains the origin and {p, x) = 1 is positive, the vector 
p{u) must be pointing outward OCm- ■ 

We can now explain how to compute the second fundamental form 
of OCm at X{u). It is determined by its value on the basis Xi of the 
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tangent space. Let us denote by = p/|p| the outward unit normal 
vector field to OCm- The proof of Lemma 9.1.6 shows that 

(diV-Xi,X,) = ^(p„X,) 

if z = m + 1, . . . , (i — 1 or j = m + 1, . . . , (i — 1 

+ p Sm+i^r^d-i(</'r op + Ur){pi, ^X,) otherwise 

Consequently, if denotes the second fundamental form of M 
relative to a unit normal vector field Y, we have 

-{dN-Xi,Xj) (9.L8) 

if = l,...,m. 

otherwise. 

This second fundamental form vanishes whenever i or j is not in 
{ 1, . . . , m }, expressing the fact that Cm is a ruled surface with 
nontrivial generators if m < d — 1. Along a generator of dimension 
d — 1 — m, the set Cm is flat and has vanishing curvature. 

Now, consider a convex function / on M*^. Since we are able to 
relate points and geometry of OCm to points and geometry of M, we 
should be able to relate points in OCm which minimize / to some 
speciflc points in M. For this purpose, to a function / defined on 
we associate the functions 

r{x) = snp{f{y):{x,y) = l}, 
f,{x)=M{f{y):{x,y) = l}. 

The basic properties of these transforms will be of some use and are 
stated in the following proposition. Notice that the statement (iv) in 
the Proposition does not require any smoothness. 

9.1.7. PROPOSITION. Let I be a convex function on W^, such that 

lim|2,|^oj3 /(x) = oo. Then, 

(i) I, is continuous on M"^ \ { 0} with lim[^|^o -^•(2^) = 00. Moreover, 
if I is minimal at 0, then lim^^^^^ I,{x) = /(O); 

(ii) ifI{Cl,) = I{dCM), then 

licit) = inf {l,{p):peM}= I.{M) = I, [Mo) ; 
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(even if M is not reduced) 

(Hi) points in minimizing I correspond naturally to points in Mq 
minimizing I in the sense that 

IJ {m e Mq : {m,x) = 1} 

{xeClj:I{x)=I{Cl,)} 

= {m e Mo : I,im) = I,{Mo)} ■ 

(iv) for any convex function f with its minimum at 0, the equality 
if,)' = f holds. 

Proof. (i) If {x,p) = 1, then 1 ^ \x\\p\. Consequently, 
I.{x)^mi{l{p):\p\^l/\x\}, 

and 

lim I,{x) = lim I{p) = oo . 
Moreover, if {x,p) = 1, then p = y-^j^ + Proj^xp. Therefore, 

/(O) = inf { I{p) ■.\p\eR'^}^ I,{x) ^ inf { / + 5) : g ± x } 

The equahty hm|a.|^cx) = -^(0) fohows. 

Let us now prove that /, is continuous. Let p be a nonzero vector in 
W^. Let e be positive and less than \p\. Consider a point q at distance 
e from p. Since / is continuous and blows up at infinity, there exists 
x in such that {x,p) = 1 and I{x) = I,{p). Then, 

\{x,q) - 1| = \{x,q-p)\ ^ \x\e. 

Hence, there exists y in such that {y, q) = 1 and \x — y\ ^ e|a;|/|g| 
— take y = X — [{x, q) — Therefore, 

I,{q) ^ I{y) ^ sup I I{v) : 1^; - a;| ^ j^e | . 

Since / is continuous and p is nonzero, it follows that 
limsup/,(g) ^ I{x) = I,{p) . 
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Next, consider a sequence pk in the ball B{p,e) of radius e centered 
at p, converging to p, and such that linifc^oo h{Pk) = liniinfg^p/.(g'). 
Let Xk be such that {pk,Xk) = 1 and I,{pk) = I{xk)- The function / 
is bounded on the set g/lf/p for g in B{p, e) provided e is strictly less 
than \p\. Then, the sequence I,{pk) ^ l{Pk/\Pk\^) is bounded, and 
so is the sequence /(x^). Therefore, X}^ is in a compact set /^-'^([0,c]) 
for some positive c, and admits a clustering point x. After taking 
a subsequence, we can assume that Xk converges to a; as tends to 
infinity. Since {xk,Pk) = Ij we have {x,p) = 1 and thus I,{p) ^ 
Moreover, continuity of / implies 

I,{p) ^ I{x) = lim I{xk) = lim I,{pk) = liminf . 

fc— >oo fc— »oo </— *P 

Overall, lim^^p /.((/) = I,{p) and /. is continuous on M'' \ { }. 

(ii) Let e be positive and p in M such that I,{p) ^ + e. There 
exists X in such that {x,p) = 1 and I,{p) = I{x). Since {x,p) = 1, 
the point x is not in the interior of Cm- Since / is continuous, 

HCli) = /((intCM)') ^ /(x) = /.(p) ^ /.(M) + e. 

Since e is arbitrary, wc established the inequality I{C\^) ^ I,{M). 

Next, we use the assumption /(C^) = I{dCM)- Continuity of / 
ensures that there exists x in OCm such that I{x) = I{dCM)- Hence, 
there exists p in the closure of M with (p, x) = 1 — otherwise, x 
would not be on the boundary of Cm- Therefore, ^ and 

since /, is continuous, 

/.(clM) = I.{M) ^ I,{p) ^ I{x) = I{CIj) . 

This proves I{Clj) = I,{M). Since Cm = Cmq: we also obtain 
I.{M) = /.(Mo). This proves assertion (ii). Notice also that we 
proved I,{p) = I{x). 

(iii) Let x be in OCm minimizing / over C^. Since Mq is closed, 
{x,m) = 1 for some m in Mq. For this m, if I,{m) < I{x), then there 
exists y such that {m,y) = 1 and I{y) < I{x). Given how we choose 
X, the point y is not in Cpj. Since {m,y) = 1 and Mq is reduced, 
y is in DCm-, and x does not minimize / over C\j. Consequently, 
/,(m) = I{x). Then, assertion (h) implies that /.(m) = I,{Mq). This 
proves that in statement (iii) , the set in the left hand side is included 
in that in the right hand side. 

To prove the reverse inclusion, let m in Mq minimizing /,. Since 
lim|2,|^oo /(x) = oo and / is continuous, /,(m) = /(x) for some x such 
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that {m,x) = 1. Then, I{x) = I,{Mq) and x is not in the interior of 
Cm- Then, assertion (ii) imphes I{x) = I(C1^). Consequently, the 
set in the right hand side of statement (iii) is included in that in the 
left hand side. 

(iv) If {x,p) = 1, then f.{p) ^ fix). Thus, ^ f{x). Now 

seeking a contradiction, let x be such that f{x) is positive and assume 
that < f{x). Let be a supporting hyperplanc at x of the 

level set {y : f{y) ^ f{x)}- We can write H = {e}-*- for some 
e in W^. If e is orthogonal to x, then x is in H. Since / is convex, 
f{x) ^ f{x+h) for any h in H. Taking h = —x leads /(O) ^ f{x) > 0, 
a contradiction. Thus, e is not orthogonal to x. 
Notice that 

{P ■■ {P,x) = l} = ^-^ + y:y±xY 
Moreover, if y is orthogonal to x, 

= \ ax + (1 — a)-r^ + z : a ^M., z -L span(x, y) \ . 

L l?/r 

Going back to the definition of /, and (/,)*, the assumption < 
f{x) can be rewritten: There exists e positive such that whenever y is 
orthogonal to x, the inequality f^ax + (1 — a)y\y\~'^ + z) ^ f{x) — e 
holds for some real a and z orthogonal to span(.x, y). 

Consider the following y. If e is coUinear to x, let y be any nonzero 
vector in { e}^. Otherwise, let y = aPToj^±e + bx, where a, b are such 
that 

= (y - \yfx,e) 
= |Proj3.±ep(x, e) + a (Proj^-xe, e) + + 6)(a;, e) . 

This quadratic equation in a always has a solution for |6| large enough 
as well as |6| small enough, since (x,e) is nonzero. Moreover, for |6| 
small enough but nonzero, the solution is not 0, and y is not null. 

This choice of y ensures that e is in the space spanned by x and y. 
Consequently, the orthocomplement of span(a;, y) is in H. Moreover, 
for any real a, 

ax + (1 - = X + ^. ." (y - \y^x) ex + H . 

br \y\ 



9.2. Example with a light tail 



177 



Consequently, for any real a and any z orthogonal to span(x, y), the 
point ax + {I — a)y\y\~'^ + z belongs \o x-\- H. Since / is convex, 

j{ax^ (1 -«)|^ + 2) ^ 

contradicting (/,)'(a;) < f{x). ■ 

Given Lemma 9.1.1 and Proposition 9.1.2, we can replace Mq by 
M in statement (iii) of Proposition 9.1.5. 

The explicit calculation of /, depends of course on /. Notice 
however that homogeneity is preserved, as indicated in the following 
result. 

9.1.8. LEMMA. // / is positively a-homogeneous, then I, is —a- 
homogeneous. 

Proof. The result is straightforward since 

I, (tx) = inf { I{y) ■.{tx,y) = l} = M{ I{y/t) ■.{x,y) = l} 

= t-'^I,{x) . m 

Finally, we calculate /, in an important case for applications. For 
r positive , let \x\r = (Z^i^i^d I^^D^^'^- larger than 1 this is 

the ^r-norm of x. 

9.1.9. LEMMA. If I{x) = c{\xi\'^ + ■ ■ ■ + \xd\'') , then I,{p) = c/\p\'l 
where a and P are conjugate — i.e., a"^ + /3~^ = 1. 

Proof. Without any loss of generality, we can assume that c = 1. 
If {p,x) = 1, Holder's inequality yields 1 ^ |a;|a|j3|/3, and so I{x) = 

On the other hand, for Xi = sign{pi)\pi\'°^ /\p\^, we have {x,p) = 1 
and I{x) = 1/bl^. 

In the situation described in Lemma 9.1.9, one sees that I,{M) is 
related to sup { \p\fj : p e M }, that is to the radius of the smallest 
ball in £p which contains M. 

9.2. Example with a light tail. 
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In this section, we consider a random vector X = {Xi,...,Xd) 
in M'^, having a log-concave density cxp(— /). Our first result is 
elementary. It is inspired by its Gaussian analogue, where I{x) = 
^|a;p + log(27r)'^/^. It shows that under a growth control on /, we 
can estimate the exponential decay of P{X{M) ^ t} as t tends to 
infinity. 

9.2.1. PROPOSITION. If M is compact and 

lim lim sup ^^^/.^^ ^ 1, (9.2.1) 



then 



Proof. Recall that the events {X{M) t] and {X ^ tCM} are 
equal. Let at = I{tC\j). Proposition 2.2 yields 



if and only if 



lim — log|X(M) ^t} = -1 

t^co at 



hm hm inf - log |tC^ fl | = . (9.2.2) 



Let e be positive. Since / is convex and Cm is a neighborhood of the 
origin the points minimizing / over are included in tCu for t large 
enough. Consequently, for t large enough, there exists Xt in d{tCM) 
such that I{tC%j) = I{xt). 

Let ?7 be a positive number. Consider a set of orthogonal vectors 
fi, . . . , in a supporting hyperplane of OCm at xt, such that rj/2 ^ 
\vi\ ^ 77 for alH = 1, ... ,d — 1. The supporting hyperplane is the 
orthocomplement of a unit vector e, pointing outward from OCm- We 
define = rje. Since M is compact, the interior of Cm contains the 
origin. Therefore, xt tends to infinity with t. Consequently, if rj is 
small enough, (9.2.1) implies that for t large enough 

I{xt + Vi) ^ (1 + e)I{xt) = (1 + e)at . 

Since r(x_|_g)(jj is convex, the simplex with vertices xt, Xf+Vi, 1 ^ i ^ d, 
is in tC^ Pi r(i+e)a^ . Its volume does not depend on t; it bounds the 
volume of tC|^ f] r(i_,_g)„^ below. Consequently, (9.2.2) holds. 
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It remains to prove that I{tCl^) = I,{M/t). This is clear since 
tCu = C'm/*) and Proposition 9.1.7 holds. ■ 

Condition (9.2.1) looks good, but is far from being the best that 
we can obtain. In particular, it does not cover the function I{x) = 
exp(|,T|°) with a > I. The following will do, but assumes that / is 
differentiable. 



9.2.2. PROPOSITION. In Proposition 9.2.1, one can replace as- 
sumption (9.2.1) by 

e-^n^) supui^i |D/(.x + ne-^-f(^))| 
lim lim sup ^ — — = . 

<5iO x^oo I{x) 



Notice that since I blows up at infinity, e"'^-'^^^^ is very tiny for large 
X. In essence, when d = \, the new condition asserts that I'e~^^ jl 
tends to 0. When / is large, 

I'e-^'/I ^ = 2(e-'5V2)//<5. 

Therefore, the only possible limit for I'e^^^ /I as its argument tends 
to infinity is — but in this discussion, nothing guarantees that the 
limit exists. This does not show that the new condition holds for any 
convex fTinction: but it suggests that those which do not satisfy this 
condition are rather pathological. 

Proof of Proposition 9.2.2. We follow the proof of Proposition 9.2.1. 
All what we need to do is to specify how to pick the vectors Vi, 
1 ^ i ^ d. The new assumption implies that there exists a function 
6{x) tending to at infinity, such that 

Let r] = exp ( — 6{xt)I{xt)) , and let e be an arbitrary positive number. 
The previous limit shows that 



rj sup |D/(y)| ^ e/(a;t) 
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for t large enough. With this new rj, take the Vi's exactly as in the proof 
of Proposition 9.2.1. Since \vi\ is at most rj, we have, for i = 1, . . . , d, 

I{xt + Vi) ^I{xt) + \vi\ sup |D/(y)| ^/(a;t)(l + e). 

y-\x-y\^\vi\ 

Consequently, the simplex with vertices xt, Xt + Vi, 1 ^ i ^ d lies in 
tCM n ^{i+t)i{xt)- Its volume is of order rj'^. Thus, to check (9.2.2), it 
suffices to show that 

lim — ^-^ = lim —5(xt) = . 

This is plain from the definition oi 8{-). ■ 

Clearly, one can do many variations on the theme, and get different 
conditions for the conclusion of Proposition 9.2.1 to hold. 

To obtain a sharper result than in Propositions 9.2.1 or 9.2.2, 
that is to estimate F{X(M) ^ and not its logarithm, we need 
further assumptions. Many results could be obtained under various 
hypotheses. We will suppose that the random vector X has density 
ae~^ , where 

/ is convex, a-positively homogeneous for some a > 1. (9.2.3) 

Under (9.2.3), Theorem 7.1 settles more or less the question of 
approximating 

P{tC'M) = a f e-^(^)dx. 

Of course, we need to verify the assumptions of Theorem 7.1. Since 
Cm is convex, assumption (7.5) is always satisfied, while (7.1) is 
guaranteed by the boundedness of M. Thus, only (7.3) and (7.4) 
are left to check. 

It does not seem that we can work out a general theory much 
further. But let us show how we can obtain a tail equivalent for 
P{X{M) ^ t} from Theorem 7.1. 

Proposition 9.2.1 and Lemma 9.1.8 imply 

hm 1 logP{ X{M) ^t} = -I,{M) 

and so we are done as far as the exponential term is concerned. 
Reading the formula in Theorem 7.1, we have I{Ai) = I,(M). 
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We then need to relate the differential geometric quantities involved 
in Theorem 7.1 to those of M. From section 9.1, we infer that we can 
reduce M to Mq. So, let us assume that 

Mo is a closed, connected, m-dimensional submanifold of W^. 

Since we can replace M by Mq, we will drop the subscript and write 
M instead of Mq until the end of this section. Consider a local 
parameterization p{u) of M. Wc have seen in section 9.1 that it 
induces a parameterization X{u) on OCm- Assume that 

V,,M = {peM ■.I.{p) = I.{M)} 

is a /c-dimensional submanifold of M. 

We can choose the parameterization such that 

{ui,...,Uk) eU' C R'^ ^p(ui,...,Ufc,0,...,0) G P.,M 

is a local parameterization of T>,^m- With the notation of section 9.1, 
it follows from Lemma 9.1.6 and Proposition 9.1.7 that 

{ui,...,uk) eU' X{ui,.. .,uk,0,...,0)e VcIj 

is a parameterization of T^ci^ ■ To compute the Riemannian measure 
M-Vnc , we first compute the first fundamental form of the surface 
■ We obtain 

1/2 

dMvcc {X{u)) = (det{{Xi{u) , Xj{u))) ) dui A • • • A d^Xfe , 

where Xi{u) is given in (9.1.5). The first fundamental form of OCm 
involves inner products of v, Ui, Xi, dXj -pi, which can be expressed in 

term of the third fundamental form of M — or analogously, in term of 
the connection form on its normal frame principal bundle. In general, 
such an expression is rather involved; we will see how it simplifies in 
some cases. 

To compute the curvature term Gc in Theorem 7.1, the comment 
after the statement of Theorem 7. 1 shows that it equals 

G{x) = nX^^^^j,^; - '^dCM,x 

for aU X in T>ci^. Since G{x) = Uaj^^^^^^^ - '^dCM,x vanishes on 
directions tangent to I^cs, ) one way to compute det G{x) is actually to 
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diagonalizc G{x), and take the product of its positive eigenvalues — 
by construction, the eigenvalues of G{x) are nonnegative; they are null 
only on the eigensubspace TxdClj Q T^Vcy. Ultimately, we need to 
calculate det(G(a;) — Aid). For this purpose, consider an orthonormal 
basis ei, . . . , ea-i of T^dlCli) = T^ki^cij)- 

Denote by X the matrix obtained in writing the vectors Xi, . . . , 
Xd-i in the basis ei,...,ed_i, that is X = {{ei, Xj))-^^-^^^^. This 
matrix is nonsingular and 



det(G(x) - Aid) = ^ det(;f'^G(a;);f - XX^X) = 

^ det{{X^X)-^X^G{x)X - Aid) = . 

How do wc compute X'^GX and {X'^X)-'^7 

Write Nqcm the outward unit normal to BCm- Since 

we first compute (D^/(x)Xj, Xj), 1 ^ i,j ^ d — 1. The terms 
{dNgcj^Xi,Xj), 1 ^ i,j ^ d — 1 can be computed with formula 
(9T.8) and depend on the curvature of M via its second fundamental 
forms UIj and 

The first fundamental form X^X = {{Xi, Xj))i^ij^n-i can be 
computed in the same way. One may notice however that for i = 
1, . . . ,m and j = m + 1, . . . ,d — 1, 

{Xi, Xj) = i^^^ + dct>j{p) ■Pi+ Yl Md^r ■ Pi , Xj) . 

Moreover, if i, j = m + 1, . . . , d — 1, then 

{Xi,Xj) = 5ij . (Kronecker symbol) 

Finally, if i, j = 1, 2, . . . , m, the expression of {Xi,Xj) involves again 
the third fundamental form of M. 

At this point, it does not seem possible to push the abstract 
calculation much further. The author hopes that it is clear that the 

tail behavior of X{M) is governed by the differential geometry of M 
immersed in M"', a somewhat known fact. All the calculations can be 
implemented on a computer. 
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Some simplifications may occur in some specific cases. We now 

discuss some important ones. 

Assume for instance that / is a radial function, namely that 

I{x) = j(|a;p) for a function r € [0, oo) J{V^) (Q 2 A) 
which is convex on M+, and increasing. 

Under (9.2.4), / is minimal if and only if \x\ is. Thus, the normal- 
ization minimum at Um+i = • • • = Ud-i = forces (f)j = 0. 
Moreover, at the minimum, i.e., on I'c^, the norm is mini- 
mum, and therefore, 

X±Xi, z = l,2,...,d-lonPcs,- 

Since z/j is orthogonal to Xi for i = 1,2, . . . , d — 1, the vectors X and 
arc collincar on T^c\j- Consequently, X, v, p are collinear on T^Cf^, 

and all point outward A/(-(^c^). 

Notice also that \u\ = 1 forces {u, Ui) = {p , Ui) = 0. 
Equation (9.1.5) becomes very simple then, namely 

Xi = - — ^— = 7^ on Vc": and for i = 1, 2, . . . , m . 
{i^,p} \p\ ^ 

It follows that along I'c^ , the first fundamental form of OCm is given 

by 

{\p\~^{ui,iyj) , i,j = 1,2, ...,m. 

\p\-^{ui,Xj) , i = 1,2, . . . ,m, J = m-l- 1,. . . - 1. 

Sij , i,j = in + l,...,d-l. 

The second fundamental form of OCm also undergoes some simplifi- 
cations. Indeed equation (9.1.8) becomes 



{dNac^Xi,Xj) 



if i V J = m-h 1, . . . 1. 

-\p\~^{Pi,^j) if i,i = 1,2, ...,m. 

(9.2.5) 

— remember that the normalization Um+i = ■ ■ ■ = u^-i = at the 
minimum forces (/>j = on T^ci^ ■ 

REMARK. Be careful when using (9.2.5). The X/s, for j = 
m + I, . . . ,d — 1 have unit norm. But for j = 1, . . . , m, the norm of 
Xj may not be 1. If one wants an expression of the fundamental form 
in a basis of unit vectors, one should divide Xj by its norm in (9.2.5). 
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Nothing guarantees that the Xj^s for j = 1, . . . , m are orthogonal to 
the Xj^s for j = m+ 1, . . . , d— 1. Thus to use (9.2.5) in computations, 
one needs to use some form of orthogonahzation technique to obtain 
the matrix dNgcj^ in an orthonormal basis. 

REMARK. If we rescale M to AM, then Cm becomes Cxm = Cm /A. 
So, the second fundamental form of Cxm should be proportional to 
A. Consequently, the expression on the right hand side of (9.2.5) is 

homogeneous in A. With our choice, the parameterization of C\m is 
Xx{u) = X{u)/ X. ThcYeforc Xx^i = Xi/ \. For i, j = 1, 2, . . . , m, the 
left hand side of (9.2.5) written for Cxm reads 

{dNec,^Xx,i,Xxj) = (diVac.My > y ) • 
Thus, for CtM, formula (9.2.5) is 

{dNQc,^X„X,) = X{p,,N,)/\p\\ 

This is indeed homogeneous in A since Xi and Xj do not depend on 
A. 

Similarly to what happens for the second fundamental form of OCm 
along T^cIj j the second fundamental form of the level sets Aj^qc ^ 
undergoes a great simplification when / is a radial function. Since the 
level sets are spheres, their second fundamental form at X is 

nA^^^^jX)iX,,X,) = \X\-\X,,X,) = \p\{X,,X,) on Vci^ 

— recall that for a radial /, we proved that X{u) is collinear to p{u), 
and so 1 = {X{u),p{u)) = \X{u)\ \p{u)\. 

Further simplifications may occur by a good choice of the parame- 
terizations. For example, it may happen that 2^c^ is is parametrized 
by {ui, . . . , Uk, 0, . . . , 0), where k = dimPc'c^. In the case dim Af = 1, 
we can take X^ to be collinear to the torsion vector of the curve M . 
This ensures that = v' is orthogonal to X4, . . . ^X^-i. The case 
m = d — 1 \s also rather specific. There is a vast number of possible 
specializations where more or less remarkable formulas can be ob- 
tained. However, it is not obvious that such extensive developments 
would bring more insight. They may be worthwhile for some specific 
applications and we will see some in chapters 10-12. 

To conclude, we mention that when Theorem 7.5 applies, it shows 
that the distribution of X/t given X{M) larger than t converges to 
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a distribution supported by T>ci^ = { x G OCm '■ lix) = I{C%[) } as 
t tends to infinity. To a point x in I'cj^ eorrespond points m in Mq 
such that {x,m) = 1. Proposition 9.1.7.iii shows that as x varies in 
T^Cii, these m's vary among the points in Mq minimizing /,. This 
imphes that when M is closed, {me Mq : X{m) = X{M) } given 
X{M) ^ t is a random closed set whose distribution given X{M) ^ t 
tends to be concentrated on points in Mq that minimize /,. When 
X{M) is achieved at a unique point m{X) in Mq, Theorem 7.5 even 
gives the limiting conditional distribution of argmax^gMo X{m) given 
X[M) ^ t, as t tends to infinity. It is the image measure by m of the 
limiting conditional distribution of X/t given X £ tCj^; this latter 
limiting distribution is given in Theorem 7.5. In particular, if /, is 
minimum at a unique point m* of Mq, then argmaXp^M X{p) given 
X{M) ^ t converges in probability to m* as t tends to infinity.. 

9.3. Example with heavy tail distribution. 

In this section we consider a random vector X = {Xi, . . . ,Xd) inW^ 
with independent coefficients, all having a Student-like distribution 
with parameter a. Thus, the distribution of Xi is absolutely continu- 
ous with respect to the Lebesgue measure, and satisfies 

P{Xi^-x}^ P{Xi^x}^ Ks,aa^''-^^/^x-'' 

as X tends to infinity, for some constant Kg 

The estimation of P{ X ^ C} for an arbitrary convex set C 
containing a neighborhood of the origin turns out to be amazingly 
simple. Recall that (ei, . . . , e^) denotes the canonical basis of W^. It 
is convenient to introduce the following terminology. 

DEFINITION. Let S be a set in M.^, containing a neighborhood of the 
origin. A point of the form Ae^ on dS is called an axial point of dS. 

As their name suggests, axial points of dC are points on dC which 
lie on a canonical axis. Notice that if C is not the whole space M!^, 
then dC has at least one axial point. Moreover, dC has at most 2d 
axial points. These lower and upper bound can be achieved. For 
instance, the half space {x £ M"^ : ^ 1 } has a unique axial point, 
ei; and the centered unit ball has 2d of them, namely plus or minus 
the vectors of the canonical basis. 
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9.3.1. THEOREM. Let C be a convex neighborhood of in W^. 
Assume that C is not the whole space . Let X be a random vector 
with independent and identically distributed components having a 
Student-like distribution Sa, with parameter a, and such that S'q.(O) = 
1/2. Then 



oo . 



P{X ^tC}^ W as t 

a 

where the sum is taken over all the axial points of dC . 

Proof. Notice first that if dC is not smooth at some of its 
axial points, say ai,...,ak, then we can sandwich C between two 
smooth convex sets with axial points (1 — e)ai, . . . , (1 — e)ak and 
(1 + e)ai, . . . , (1 + e)afc respectively. For these approximating convex 
sets, the asymptotic formula in Theorem 9.3.1 can be proved assuming 
that dC is smooth. We then let e tend to 0. So, there is no loss of 
generality in assuming that dC is smooth, which we do from now until 
the end of the proof. 

The proof of Theorem 9.3.1 is then essentially the same as that 
of Theorems 8.2.1 and 8.2.10. It is hoped that the reader will be 
convinced that our unifying formalism is actually quite convenient, 
even though, one more time, some specific examples could be treated 
more easily with ad hoc methods. 

Let Sa denote the density of a single Xi. To prove Theorem 9.3.1, 
we need to approximate the integral 

P{X ^tC] = / Sa{xi) . . .Sa{Xd)dxi. . .dXd 

J'R'^\tC 

for large t. As in the proof of Theorem 8.2.1, extend the function 
o Sa to by considering it acting componentwise on each 
coordinate. Making the change of variable Y = ^'~o Sa{X) leaves us 
to approximate 

e" 



This leads us to consider the convex function 

|2 



y 

and the sets 



/(y) = ^-log(27r)'^/2 



At = tC" , and Bt = <!>^o 5a (tC" 
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Our first proposition hereafter evaluates I{Bt) and locates the 
points of interests in Bt as far as minimizing the function / is 
concerned. To this aim, define 

7 = min { |a| : a axial point of dC } . 

9.3.2. PROPOSITION. As t tends to infinity, we have 

I{Bt) = alogt - ^loglogt + alog7 

- log{Ks,^a-/^2V^) + log(27r)<^/2 + o(l) . 

// a is an axial point of dC, then 

TBt ($^o Sa{ta)) = a log (|a|/7) + o(l) as t ^ oo . 

For any positive numher Mi, the set dBt H i[Bt)+Mi\og\o^t ^^^-^ in 
an 0(\/log logt) -neighborhood of the points Sa{ta), where a is an 
axial point ofdC. Consequently, there exists a positive such that 
this set lies in the image through $""0 Sa of a (logt)^^ -neighborhood 
of the axial points oftC. 

Proof. An easy application of the expansion for ($*~o Sa)"^ given in 
Lemma A. 1.5 gives for any axial point a of dC, 

l{^^oSa{ta)) =alogt- ^loglogi + alog|a| -log(X,,„a"/22V^) 

+ log(27r)<^/2 + 0(1) 

as t tends to infinity. 

Let Ml be a positive real number. Consider a point u in C"^ such 
that 

I o Sa{tu)) ^ alogt + Ml loglogt. (9.3.1) 
Consider rj < 1 — (l/\/2). For t large enough, the inequality 

l{^'-oSa{tu))^a{l-rj) J2 h-^,oo){\ui\) log {t\ui\) 

^ a{l - r]fti{ l^i^d: \ui\ ^ i"" } logt 
implies that for u as considered, 

tt{ 1 s; i ^ d : \u^\ ^t-'^j^il- ry)-2(i + o(l)) < 2 . 



188 



Chapter 9. Random linear forms 



However, since u is not in C, we must have 

for t large enough. Consequently, u has exactly one coordinate larger 
than t"'^. Thus, it must be in a ^^''-neighborhood of the canonical 
axes in M'^. Such a point is of the form u = \a + v for some axial point 
a of DC, some A ^ 1 — t~''|a|~^ and v orthogonal to a with \v\ ^ t~'^. 
Consequently, 

7($^o Saitu)) = h'-o Sa{tX\a\)^ + ^1* Sa{tv)\^ + log(27r)'^/2 . 

This expression is asymptotically minimum for t; = and A = 1 and 
\a\ minimum, i.e., \a\ = 7. Combined with Lemma A. 1.5, this gives 
the value of I{Bt) up to o(l) as t tends to infinity. Then, the value 
for tbj($^o Sa{ta)^ follows from Lemma A. 1.5 as well. 
Still assuming (9.3.1), we must have 

a log i + Ml log log t ^ o Sa{t\a\{l + o{l))y 

+ ]- max $^0 Sc,{tVif + log(2^)^/2 

Hence, using Lemma A. 1.5 to approximate Sa (t\a\ (l + o{l] 
max $^0 Sa{tvif ^ (Ml + 1) log log i 

for t large enoug h, that is $^0 Sa{tu) = 0{\og\ogtf/'^ . Therefore, 
^*~oSa{tv) is indeed in an 0(log log t) ^/^-neighborhood of ^"^oSaita). 

Given the expression for $"^0 Sa in Lemma A. 1.5, we also must 
have 

max a log {t\vi\) ^ (Mi + 1) log log t , 

that is, maxi^i^rf ^ {logt)^^'^'^^^/'^ for t large enough, which is 
the last statement of the proposition. ■ 

We can now try to calculate the asymptotic equivalent given by 
Theorem 5.1. Given the proof of Proposition 9.3.2, it is natural to try 
the projection of the axial points of dBt onto ^i(Bt) '^^ ^ dominating 
manifold. So, let pt be the radius of the ball A/^^^), and set 

I^Bt = I Pt-^—r-^rrrTJ ■ « ^^i^^ Poi^* of 9C ) . 

L \^^oSa{ta)\ ) 
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Notice that because we assumed S'a(O) = 1/2, the equahty $^ o 
Sa{sei) = c&^o Sa{s)ei holds for any canonical vector Cj of M!^ and 
any real number s. Equivalently, we have 

= I P^^Y^ '■ ^ axial point of dC | . 

The dimension oiVst is A; = 0. 

From the values of I{Bt) in Proposition 9.3.2, we infer that 

pt = [2alogt-loglogt + 2alog7-21og(XaQ;''''^2V^) +0(1)]^'^^ 

~ V' 2a log t as i ^ oo . 

Putting all the pieces together, and assuming that we can verify its 
assumptions, the approximation formula in Theorem 5.1 yields. 



cxp - a log (|a|/7) 



«°/^V2 ^ 1 



■(2«logt)('^+i)/4^|,|„(dctGB.Ua/|a|))^/'' 



as t tends to infinity, with the sum taken over all the axial points a 
of dC. So, it remains for us to calculate det Gsj(/9ta/|a|) for all the 
axial points of dC, and check the assumptions of Theorem 5.1. Our 
next lemma does half of the task. 

9.3.3. LEMMA. For any axial point a ofdC, 

GBtipta/\a\) ^ ast^oo. 

V2q; log t 

Proof. Let a be an axial point of dC and let u be orthogonal to 
a. Since the origin is in the interior of the convex set C, the line 
oR intersects dC transversally. Therefore, we can parameterize dC 
around a by a ball in {a}"*", centered at 0. In other words, there 
exists a smooth function ha : {a}^ ^ M. such that 

Pa ■■ u e {a}^ 1-^ Pa{u) = (l + ha{u))a + uedC 

defines a parameterization of dC around a — i.e., for 1^1 small enough, 
it is a parameterization. Since u is orthogonal to a and o Sa acts 
componentwise, 

Sc,{tpa{u)) = Sa(t{l + + $"-0 Sa{tu) . 
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If /($^o Sa{tpa)) ^ liBt) + MiVloglogt — the domain which 
will interest us after we choose cb^ — Proposition 9.3.2 asserts 
that |u| ^ (logt)^^/t for some M2. Since dC is smooth, we have 
\h{u)\ ^ Mslul ^ Ms{logt)'^^'^ /t for some positive M3 and in this 
range of u. Consequently 

Sa(^t{l + h{u))aj = $^0 Sa{ta) + o(l) as t ^ 00 ; 
and in the range l^l ^ {logt)^'^/t, 

$^0 Sa{tpa{u)) = $^0 Sa{ta) + $^0 Sa{tu) + o(l) (9.3.2) 

as t tends to infinity. Up to the term in o(l), this last equation defines 
a plane orthogonal to $^o5'a(ta). Followingtheproof of Lemma 8.2.8, 
it follows that 



pt V2a logi 



Lemma 9.3.3 implies 

detGBt{pta/\a\y^^ (2alog^)-('^-^)/^ as t ^ 00 . 

With the estimate of P{At) obtained before the statement of Lemma 
9.3.3, we obtain 

fC ^(a-l)/2 

P{At) '-^^ ast^oo. (9.3.3) 

a 

This is the asymptotic equivalent given in Theorem 9.3.1. Thus, it 
remains for us to check the assumptions of Theorem 5.L Taking the 
risk of making the rest of the proof boring, we will do it in a systematic 
way, showing that this is a rather easy task. 

Our candidate for CBt is q = (d + 1) log log t. Indeed, combining 
Propositions 2.1 and 9.3.2, we obtain 

L{l{Bt) + ct)^ coe-^(^*)-^* (1 + I{Bt) + q)" 
= o{t^") as t ^ 00 . 

Given (9.3.3), this choice of q guarantees that (5.4) holds. 

Given Proposition 9.3.2 and the way we constructed Vb^ — using 
a projection on the sphere of radius pt — the set B_^^j^ lies on 
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0(loglogt)-neighborhood of Vst on Aj^^^t)- Since the radius of 
injectivity of A/j-^^) — a sphere of radius pt ~ ^/2a logt — is Trpt/2, 
assumption (5.2) holds for t large enough. 

Assumption (5.5) is almost plain. Equation (9.3.2) is the analogue 
of Lemma 8.2.2 or 8.2.12. It shows that the boundary dBf near an 
axis point Sa{ta) is a plane orthogonal to a up to an o(l)-tcrm. 
The very same argument as that used in the proof of Theorems 8.2.1 
and 8.2.10 ensures that (5.5) is verified here. 

(5.6) is clear. 

(5.7) follows in the very same way as in the proof of Theorem 8.2.1. 
We stih have to,M(p) = O(loglogt) while i^max(^, ^o) = 0{logt)~^ for 

(5.8) is plain, for a sphere has positive Ricci curvature in R^. 

(5.9) , (5.10) and (5.11) follow exactly as in the proof of Theorem 
8.2.1. 

(5.12) and (5.13) are plain as well given the proof of Theorem 8.2.1 
or 8.2.10, and this concludes the proof of Theorem 9.3.1. ■ 

REMARK. Note that if wc drop the assumption 5*0(0) = 1/2 in 
Theorem 9.3.1, that is Sa of median zero, axial points are not mapped 
anymore to axial points by ^'^o Sa- The argument we developed 
would still be valid though, since the asymptotic expansion for ^^oSq. 
shows that the component of <I>*~o Sa{tei) on {ej}^ is asymptotically 
negligible compared to that on e^M. Thus, the conclusion of Theorem 
9.3.1 still holds true without assuming S'„(0) = 1/2. 

From Theorem 9.3.1, we deduce the following limiting behavior of 
the conditional distribution of X given X tC. 

9.3.4. COROLLARY. Let C be any convex neighborhood of in W^, 
such that C ^ M."^. If X is a random vector with independent and 
identically distributed components having a Student-like distribution 
with parameter a, then the distribution ofX/t given X ^ tC converges 
weakly* to 

5;la|-«p„/EH"^ 

a a 

where the sums are taken over all axial points a of dC and Pa is a 
Pareto distribution concentrated on aR"*", whose cumulative distribu- 
tion function is given by 

Pa{{X,a/\a\) ^\a\ + \}= , A^O. 
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Proof. Let A be a positive number. Consider the convex set 

D = { X : (x, a/|a|) ^ |a| + A } . 

The set of all axial points of d{D^ n C^) is just { o J^y }, and a is 
an axial point of the convex set C. Consequently, applying Theorem 
9.3.1 with the convex set C and {D'^ n C^Y yields 

P{X ^ tD\X ^tC} = P{X e tiD" n } /P{ tC } 
-{\a\ + Xr/J2\ar. 

a 

This is the result, for the conditional distribution of X/t given X ^ tD 
converges trivially to Pa. ■ 

Let us now go back to the study of processes of the form {X, p) , for 
p in a set M of W^. Theorem 9.3.1 gives us the asymptotic behavior 

of P| X(M) ^ 1 1 as t tends to infinity. To obtain a more readable 
statement, we need to express the axial points of OCm in term of M. 
This is done in the next result. 

9.3.5. PROPOSITION. The set of all axial points of OCm coincide 
with the set of vectors ea^^iCi where e is in { —1, +1 }, and i is such 
that e{p, Si) > for some p belonging to M , and 

l/ae,i = sup { e (p, ei) : p G M } . 



Proof. Let a be an axial point of OCm- Necessarily a = e\a\ei 
for some e in { — 1, 1 }, and Cj a vector of the canonical basis of 
R"". Since a is in OCm, we have {a,p) ^ 1 for all p in M, and 
sup [{a,p) : p e = 1. Thus, sup { e\a\{ei,p) : p G M } = 1 and 
the result follows. ■ 

If we have a parameterization f{t) = . . . , /^(s)) of M 

indexed by s in some set S, it is particularly easy to relate the behavior 
of f{s) to the geometry of the set M captured in Proposition 9.3.5. 
This yields immediately the following result, where the reader will 
notice that the function / is completely arbitrary — no need for 
measur ability, or any kind of regularity whatsoever! 
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9.3.6. THEOREM. Let S be a set, and f{s) = (/i(s), . . . , 

be a bounded function defined on S. Let X be a random vector in 
W^, with independent and identically distributed components having a 
Student-like distribution with parameter a. Let 

X{S)=supXifi{s) + ---+Xdfdis). 
ses 

Then, 

P{X{S)^t}^Ks,a^-^ cl, ast^<x, 

£€{-1,1} 

where 

Ce,i = sup{ (e/i(s))_^ :seS}, e e {-1,1} , i = 1, . . . ,d . 

Another way to interpret c^^j is in looking at the projection of M 
on the i-th canonical axis ejM. The value c^^i is if this projection is 
concentrated on the set — ecjlR ; otherwise c^^j is the coordinate of the 
largest point of this projection. 

It is quite amusing to notice the following. Set /(s) = (1, . . . , 1) for 
all s. Then X{S) = Xi + ■ ■ ■ + Xa. Theorem 9.3.6 imphes that 

P{X, + ... + Xd^t}^^^^^^d. 
Using again Theorem 9.3.6 for d = 1, we then infer 

p{x, + ■ ■ ■ + Xd^ t} d{i - Sa{t)) = dP{x, t} . 

This is a known asymptotic identity showing that the Student-like 
distributions are subexponential! 

Notes 

This chapter is connected with a huge literature. To proceed in order, 
I first cannot quite believe that the transform / /, and its use are 
new. It allows one to read the minimization of a convex function on 
the complement of a convex set over its polar reciprocal. However, 
I have not found it in the literature. Similarly to what is called the 
infimum convolution in convex analysis, it would be natural to call 
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7, the infimum Radon transform of /. The relation (/.)* = 7 in 
Proposition 9.1.7.iv is an inversion formula for this infimum Radon 
transform. 

Lemma 9.1.3 is essentially contained in Hassanis and Koutroufiotis 
(1985). Though I refer to Schneider (1993) for convexity theory, the 
differential viewpoint in section 9.1 is closer to Bruce and Giblin 
(1992). The polar reciprocal is sometimes called the pedal surface. 
It would be desirable to connect further the global properties of Mq 
and Cm- 

Proposition 9.2. 1 is a generalization of the finite dimensional version 
of Fcrniquc (1970) and Landau and Slicpp (1970). There are many 
proofs in the Gaussian setting, and the lectures by Ledoux (1996) 
are most illuminating. The current literature on related problems is 
connected with notions such as concentration of measure and a set of 
inequalities: isoperimetric, Sobolev logarithmic, Poincare. A couple 
of pointers to this literature arc Talagrand (1995) and Bobkov and 
Ledoux (1997, 2000). However, it is not quite clear that Proposition 
9.2.1 can be recovered from the existing results in the literature. An 
open question is if the conclusion of Proposition 9.2.1 holds for any 
convex functions. 

The result of Proposition 9.2.1 also makes sense in infinite dimen- 
sion, using of course the dual space to define 7, . But the proof given 
here breaks down in infinite dimensions. 

The part of section 9.2 following Proposition 9.2.1 is connected 
with a flourishing literature on the Gaussian case and a few related 
distributions such as the chi-square. The point of view given here 
is close to an abstraction of Diebolt and Posse (1996). A radically 
different line of investigation is in Piterbarg (1996). A most interesting 
survey of the literature on supremum of Gaussian processes is in Adler 
(2000). 

In the heavy tail case. Theorem 9.3.6 seems to be part of the 
folklore; but I have not found it in the literature. Both Theorems 
9.3.1 and 9.3.6 can be proved directly by ad hoc methods. 

Concerning sums of heavy tailed random variables, Bingham, 
Goldie and Teugels (1987) contains invaluable material. I learned 
about those things in part in Broniatowski and Fuchs (1995). Much 
nicer results than the one presented in this section exist, including sec- 
ond and higher order formulas. But, unfortunately, the accuracy of 
these expansions is incredibly poor, especially when the tail parameter 
a is large. 
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It is interesting to rework the proofs of this specific chapter as- 
suming that the cumulative distribution function of Xj is 1 — F{x) = 
x~°'i{x) for some slowly varying function £ at infinity. Linearity of the 
functional considered here leads to neat results — but they depend 
heavily on the linearity! 
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In this chapter we consider a matrix X = (-^i,j)i<j j<(j with random 
coefficients that are independent and have the same distribution. 
Many qTiantities associated to X arc of interest. For instance, its trace 
tr(X), its determinant det{X), or its operator norm ||-^||. All these 
quantities have in general complicated distributions which cannot be 
calculated very explicitly. Hence, it makes sense to investigate their 
tail behavior. 

Before going further, let us mention that the trace of X is noth- 
ing but a sum of independent and identically distributed random 
variables. Results from section 9 give the tail approximations for 
P{ tr(X) ^ t } when the coefficients of M have a Weibull or a Student- 
like distributions. Therefore, in this section, we will concentrate on 
the determinant and on the norm of X. We will see that their tail 
behavior often turns to be quite interesting, if not fascinating. 

Throughout this chapter, it will be convenient to think of matrices 

2 

as vectors in as well as linear operators acting on M". In 

■ ■ 2 

particular, we denote by E^'^ the canonical orthonormal basis of M" 
viewed as matrices. Thus, E^'^ denotes the matrix with 1 on the 
(i, J ) -entry, and elsewhere. In other words. 

where 6u,v is the Kronecker symbol. 

Let M(n, M) denote the set of all nx n matrices with real coefficients. 
Also, we write GL(n,]R) for the group of all invertible matrices in 
M(n, R), that is the hnear group. 

Since our method is differential geometric, we will need the differ- 
ential and Hessian of the determinant as a map from GL(n, M) to M . 
For the sake of completeness, we recall them. 

10.0.1. LEMMA. For all x in GL(n, R) and h, k in M(n, R), 
Ddet{x)h = det(a;)tr(a;~^/i) , 

and 



D^det(a;)(/i,fc) = det(a;)tr(a:"^/i)tr(a;"^A;) - det{x)tT{x~'^hx~'^k) 
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Proof. Let x be an invertible matrix. Since det(a; + h) = 
det(x)det(Id+a;~^/i), it is enough to calculate Ddet(Id) and D^det(Id). 

Let &n denote the group of permutations of n elements. Define 
&^ = {Id} C ©n and let 6^ be the subset of ©„ made of all 
transpositions. The signature of a permutation a is e{a) = +1 
(resp. —1) if a is the composition of an even (resp. odd) number 
of transpositions. Wc have 

det(Id + sh)= e(cj) {5i^^^i) + shi^^^ij) . 

This sum over 6„ can be decomposed as a sum over 6° , plus one over 
plus a remainder term. The sum over ©° has a unique term, 

e(Id) Y[ i6i,i + shi,i) 

= i + s Y Ki + s^ Y K^hj + o{s^) 

2 

= l + str(/j) + i-( hihjj- Y hl)+0{s^) 

= l + .tr(/i) + ^(tr(/i)2- hl)+0{s^). 
Next, we also obtain 

Y n ikT{i) + Shi,aii)) 

= - XI n i'^ + shk,k)s'^hijhj^i 
= Y Kjhj,i + 0{s^) 

= -y(M/^^)- Y hh)+o{s'). 

If a is in ©„ \ U ©^), at least 3 integers in { 1, 2, . . . , n } are not 
invariant under a. For such permutation 
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It follows that 

det(Id + sh) = l + s tr(/i) + — {ti{hf - ti{h^)) + O(s^) 

as s tends to 0. Consequently, for x in GL(n,M), 

Ddet(x)(/i) = det(x)Ddet(Id)(x-^/i) = det(x)tr(x-^/i) . 
Also, we have 

D2det(Id)(/i, h) = tr{hf - tr{h^) 
and by polarization 

D2det(Id)(/i, k) = tr{h)tr{k) - ti{hk) . 

Consequently, 

D'^det{x){h, k) = det{x)D'^det{ld){x-^h,x-^k) 

= det(a;)(tr(a;~^/i)tr(a:~^A;) - tT{x-^hx-^k)) 

as claimed. ■ 

We conclude this section by a trivial but useful formula. When 
needed, we write (•, •)]^„2 for the inner product in the Euclidean 
space M"^. We use the tensor product notation, E'^'^ (g) E'-"* to 
denote the bilinear form x,y E M(n,]R) i-^^ E^'^ (g) E^''^{x,y) = 

10.0.2. LEMMA. On the basis E'^'^ , the bilinear map {h,k) e 
M(n, M) tT{x~^hx~^k) has the form 

J2 {x-')raA^-')j,lE'''®E'^"'. 
l^i,j,l,m^n 

Proof. It is straightforward, 

tT{x~^hx~^k) = {X~\jhj^i{x'^)i^rnkm,i 

= E ix-\j{x-\mE'''^E^'\h,k). - 
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Viewing M(n, R) as , we have a natural inner product 

x,ye M(n, M) ^ {x, y) = ^ Xijyij = tT{x'^y) = tT{xy'^) . 

Consequently, viewing Ddct(.T) in , Lemma 10.0.1 implies that 

Ddet(a;) = det(x)(a;-^)^ € . (10.0.1) 

10.1. Random determinants, light tails. 

In this section, we consider a random matrix X = {Xi,j)i<:i j<:n where 
the Xij are independent and identically distributed, each having a 
symmetric Weibull-like density 

q,1-(1/q;) ._\n\a. 

One of the aim of this section is to show that the Gaussian case, 
obtained for a = 2, is rather specific. The main reason is of course the 
invariance of the Gaussian distribution under the special orthogonal 
group. 

Given the densities Wa of interest, let us define 

I(x) = - Yj I^'.^'I'* ' ^ ^ M(n,R) = M"' . 

Furthermore, define 

At = {xe M(n,R) : detX ^t}= t^'^'Ai . 

Since / is homogeneous, we can use Theorem 7.1 in order to 
approximate 

P{detX^t}= — — - / e-^Wdx. 

V2r(i/a)y jtvnAi 

This requires us to compute the dominating manifold I'ai- Unfor- 
tunately, I have not been able to do so in general. The following 
result will rely on an explicit calculation in some special cases, and a 
conjecture in general. 
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Our first lemma provides a neeessary condition for a matrix to be 
in Vai- We denote by SL(n,]R) the special linear group on R", that 
is the group of all matrices of determinant 1. 

10.1.1. LEMMA. Ifx is annxn real matrix minimAzing I{x) subject 
to the constraint dctx ^ 1, then x is of determinant 1, that is belong 
to SL(n, M). Moreover, for such a matrix, 

= ^sign(a:j,i)|a;j,i|"~^ , l^i,j ^n, (10.1.1) 

where nX = min { I{x) : deta; = 1 }• 

Proof. Since det(Aa;) = A^deta; and /(Ax) = |A|"/(x), we clearly 
have detx = 1 at the constrained minimum. So, we need to find 
inf { I{x) : det x = 1 } . At the minimum, the normal vector of the 
level set of / and {x : detx = 1} are collinear. Using (10.0.1), this 
condition writes 



(\sign{xij)\xij\'^ ^) ={x 



for some nonzero A. 

Since 1 = {x~^x)i^i for alH = 1, 2, . . . , n, we obtain 

"= 5Z i^~^)i,k^k,i = J X] sign{xk,i)\xk,i\°"'^Xk,i 
A ' 

and the result follows. 



In general, I have been unable to solve (10.1.1) explicitely. But a 
very partial solution can be given, suggesting that the general one may 
be quite involved. As customary, we denote by SO(n, M) the special 
orthogonal group, that is the subgroup of SL(n, M) of all matrices X 
such that X'^X = Id. 



10.1.2. LEMMA. (t) If a = 2, thenVA, = SO(n,R). 
(ii) If n = 2 and a < 2, then 
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(Hi) If n = 2 and a> 2, then 

Proof. (i) for a = 2, equation (10.1.1) becomes 

Hence, = x'^/X and Id = x~^x = x^x/X. Moreover, deta; = 1 
thanks to Lemma 10.1.1. 

When n is odd, we deduce that 1 = dot Id = A^". Consequently, 
A = 1 and x~^ = x^ . This proves C SO(n, M). Since / is 
invariant under the action of SO(n, M), we have Va^ = SO(n, M) for 
n odd. 

When n is even, we can also have A = —1. But this implies 
—Id = x'^x. Since x^x is symmetric nonnegative, this is impossible 
for real matrices. 



(ii)-(iii): Set x = 




. Using equation (10.1.1), we rewrite the 



equality x x = Id as 

A = sign(a)|a|"-ia + sign(fe)|fe|"-i6 
= sign(a)|a|°-ic + sign(6)|6|"-id 
= sign(c)|c|"~ia + sign{d)\d\''-^ b 
A = sign(c)|c|"-ic + sign(d)|d|"-id 

Multiplying the second equality by ab, the third by cd and subtracting 
yields 

(|a|" - |d|")6c+ (|fe|" - |c|")ad = 0. 
But 1 = det X = ad — be implies then 

(iar - \d\" + \b\" - \c\")bc + |6r - |cr = o . 

At this stage, the first and last equations in (10.1.2) yields = |c|", 
i.e., |6| = |c|, which then implies \a\ = \d\. Set c = eib and d = e2a for 
61,62 in {-1,1}. 

If bd is nonzero, the second equation in (10.1.2) gives, after multi- 
plication by \bd\, 

sign(a)|d|"eisign(6)|6|2 + sign(6)|6|"sign(d)|d|2 = 0. 



\a\" + \b\" 



|c|" + 



(10.1.2) 
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Thus, sign(a)ei = — sign((i) and \d\ = \b\ = \a\ = \c\. Hence, the 
matrix is of the form 

^^'"'(e2 with £1,62, €3,64 G {-1,1}. 

The condition 1 = detx = a^(eie4 — £263) forces €164 — 6263 to be 
positive. Consequently, eie4 = +1, and 1 = 2a , i.e., a = 
Thus, 

solves (10.1.2). 

Next, if bd = 0, let us assume that, say, d = 0. Equation (10.1.2) 
reads 

A = |a|" + |6|" 
= sign(a)|a|°^-^c 
A = |c|". 

Since x is of determinant 1, the matrix x is not zero. Therefore, the 
relation 2A = |a|" + |?)|" + |c|° + > forces c^O. Hence, a = 
and |6|" = A. The matrix is of the form 



X2 = \c\ 



62 
ei 



The condition deta:2 = 1 forces |c| = 1 and ei = — e2G{— 1,1}. 

e 



Finally, if 6 = 0, similar arguments yields a solution 0:3 = 

with e in { —1, 1 }. 
We then have 



e 



J2 l(^ik)«r = 2, k = 2,3. 

If a < 2, then 2 < 22-(°/2), while if a > 2, we have the reverse 
inequality 2 > 2'^~^°'/'^h The result follows. ■ 

In general, I conjecture the following. 



10.1.3. CONJECTURE. If a ^ 2, then inf{/(x) : detx = l] is 
achieved at a finite number of matrices. Moreover, the difference of the 
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two fundamental forms of A/(sL(n,R)) o.'^d SL(n, M) at those matrices 
is positive. Finally, if a < 2, these matrices have unique nonzero 
elements on each row and each column, whose absolute value is 1; 
hence, up to signs, they are pervfiutation matrices. 

Some numerical computations support Conjecture 10.1.3. Also, 
permutation matrices satisfy equation (10.1.1). 

We can now state our approximation of the tail probability for 
detX. 

10.1.4. THEOREM. Let X = {Xij)^^-^^^ be a random matrix 
with independent and identically distributed coefficients, each having 
a symmetric Weibull like density ■ 
(i) If a = 2 then, as t tends to infinity, 

„(n-l)(n+2)/4 

P{detX^t}^ !L^_^^Vol(SO(n,M))e-"*''"/¥"'— . 
(27r)'* /^vn 

(a) If a> 2, under conjecture 10.1.3, 

P{detX ^ t} ~ cie-^(^i)*'^'"t(-("'-i)-2-')/2- ast^oo, 
where ci > is a constant. 

REMARK. The constant ci in (ii) can be numerically computed 
as will be clear from the proof and Lemma 10.1.1. The volume of 
SO(n, M) in (i) is the volume when SO(n, M) is viewed as a submanifold 
ofM" . This volume is the n(n—l)/2-dimensionalHausdorff-Lebesgue 
measure of the special orthogonal group. 

When plugging a = 2 in the exponent of t in (ii), we obtain — 1/n, 
which is clearly different from the exponent of t in (i). Hence, the 
exponent of t has a discontinuity at i = 2. 

When a = 2 and n = 1, then deta; = x\ the formula reads 

P{X^t} ^e-*'/2r^ ast^oo, 

\/2^ 

a well known fact! SO(l,i?) = {1}, and its volume measure is 
obtained by putting a Dirac mass at 1. 



Proof of Theorem 10.1.4. The result is an application of Theorem 
7.1. 
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Let us first determine the exponential term of the asymptotic 
equivalent. Since At = t^^^Ai and / is a-homogeneous, this term 
is ^-^"^"li^i) _ The calculation of I{Ai) relies upon Lemma 10.1.2 
and Conjecture 10.1.3. For a = 2, Lemma 10.1.2 implies that 
I{Ai) = /(Id) = n/2. For n = 2, Lemma 10.1.2 gives I{Ai) = 2/a if 
a < 2, while I{Ai) = 4/(a2"/2) = 22-"/2/a if a > 2. 

In general, I{Ai) can be computed numerically. 

To obtain the polynomial term in t in the asymptotic expansion, 
Theorem 7.1 requires us to calculate k = dimP^i- When a = 2, we 
obtain k = dimSO(n, M) = n(n — l)/2. For a 7^ 2, we have k = 
since P^i is discrete — here we use Conjecture 10.1.3 when n > 2. 

It remains to evaluate the constant ci in Theorem 7. 1 and to verify 
the assumptions of Theorem 7.1. Since the differential geometries 
of T>Ai and dAi are involved as well as that of A;^^^), wc need to 
calculate the differential and Hessian of det and /. Lemma 10.0.1 
takes care of the former. When dealing with matrices it is convenient 
to express D/ and D^/ on the orthonormal basis E^'^. Thinking 

■ ■ 2 

of E^'^ as an element in the dual of M" = M(n,R), we have 
E''^{M) = {E''^,M) = Mi J for any matrix M = (Mij) in M(n,M). 
With this notation, the gradient and Hessian of / have the following 
form. 

10.1.5. LEMMA. For any a ^ 1 and any nx n real matrix x, 

D/(^) = Ei^i,,-^nSign(a:,,,)|a:,jr-iE^.^- . 
Moreover, i/ a ^ 2, 

D2/(x) = (a - 1) Ei^ij^n Hj^-^E'^' ® E''' e M(n^ R) . 

Proof. Viewing M(n, M) as M** , we have for every x, h in M(n, M), 

I{x + eh) = I{x) + e sign(a;jj)|a;ij|°~^/iij 

+ ^ 5^ (a-l)K,r%. + 0(e2) 

as e tends to 0. Since hij — {E^'^ , h), we obtain the expression for D7. 
Using the polarization formula to express D'^I{x){h, k), we see that 



(Kj + Ki? - hlj - kfj = 2hijki,j = 2E''^ ^ E'^^{h, k) 
This gives the expression for TP'I{x). 
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We are equipped to determine the tangent spaces to T>a^ and 
^i{Ai)! from which we wiU deduce deiGAi- 

10.1.6. LEMMA. For any x in Va^, 

T^^l{A,) = {h£ M(n, M) : (x-^^, h) = Q] 

= { x'^"^ }^ = {xh:h£ M(n, M) , tr /i = } ; 
TxVa-^ = i/a 7^ 2 (under conjecture 10.1.3 for n^2).\ 
TxT^Ai = {xh : h skewsymmetric } if a = 2 . 

Consequently, at any x ofVAi n A/(^^), 

TxJ^i(A,)QTr,T>A, = I {xh: he M(n,R) , h symmetric , ti{h) = 0} 
^ ' ' if a = 2. 



Proof. Since A/(^^) is a level set of /, we have for all x in Va^ , 

TxAiiA,) = {mx)}^ = {x-'^}^ 
thanks to Lemmas 10.1.5 and 10.1.1. Since 

{ x-^'^ }^={h: (.T-iT, h)=0}={h: tr(x-^/i) = } 
= {xh: tr(/i) =0} 

the expressions for TxAj(^Ai) follow. 

When a is different than 2, Conjecture 10.1.3 asserts that Vai is a 
finite set, and indeed T^Va^ = 0- 

For a equal to 2, the dominating manifold is SO(n,R). The Lie 
algebra of SO(n, M) is the set of all skewsymmetric matrices — see, 
e.g., Knapp, 1996, §1.1 — and the expression for T^Va^ follows in this 
case. 

The result on T^Ak^Ai) © T^Vai is then clear since the skewsym- 
metric matrices are orthogonal to the symmetric ones. ■ 

In order to describe the matrix Gai involved in Theorem 7.1, recall 
that the ^p-norm of a vector a; G = M(n, R) is 
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10.1.7. LEMMA. For x in Va^, the matrix Gai{x), is obtained in 
restricting the bilinear form 

to the subspace T^Ak^Ai) © T^Vai- 

Proof. Given the comment following Theorem 7.1, it is enough 
to calculate the second fundamental form of the hypersurface Aj(^^) 
(resp. dAi). Since this hypersurface is the level set of the function / 
(resp. dct), its second fundamental form is the restriction of D^//|D/| 
(resp. D^det/|Ddet|) to the tangent space of A/(^j) (resp. dAi). 
Lemma 10.1.5 gives 

B^Iix) _ {a-l) Ei^^,,<n \x^X-'E^'' ® E^'' 

|D/(x)| (y- l^. .|2(a-l)^V2 

= T1^ E Kjr'E^'^^E^'^. 

I^l2(a-l) l^i,j^n 

To calculate the second fundamental form of dAi at a point x 
in "Dai, notice that for h tangent to dAi, Lemma 10.0.1 implies 
tr(x~^/i) = 0. Consequently, for h,k in T^dAi, Lemma 10.0.1 yields 
B^det{x){h,k) = -tT{x-'^hx-'^k). Then, Lemma 10.0.2 and (10.0.1) 
show that for x G I'ah the matrix D^det(a;)/|Ddet(a;)| is 

- E {x-')m4^-')j,lE''^ ® E^'"^/\{x-Y\ 

li^ij.l.mi^n 

and the result follows. ■ 

10.1.8. LEMMA. If a = 2 and X belongs to Vai = SO(n,M), then 

2 

GAi{x) = ^Id]g(„-l)(„+2)/2 . 

Y ri 



Proof. For a = 2 the differential of / is the identity. If x is 
in "Dai, Lemma 10.1.2 forces |D/(x)| = |x| = y/n. Furthermore, 
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since T>Ai is the special orthogonal group, Ddet(.T) = (x^^)'^ = x on 
Vai = SO(n,]R). Consequently, |Ddet(a;)| = ^/n on Vai- 

If h, k are in T^Aj^Ai) © T^Va^, and x is in SO(n, M), Lemma 
10.1.6 implies [x'^h)"^ = x~^h since x~^h is symmetric as well as 
tT(x~^h) = 0. We then infer from Lemma 10.0.1 that 

D'^det{x){h,k) = -tr{x-^hx-^k) = -tr{{x-^ h^x'^k) = -ti{h^k) 

= -{h,k). 

Consequently, the restriction of the bilinear form D^det to TxAj(^Ai) © 
TxVai is the identity. When x is in "Dai , it follows that Gai {x){h , k) 
= 2{h,k)/^/n on T^Ajf^Ai) © T^T^Ai- Since T^Aji^Ai) © T^T^Ai has 
dimension (n — l)(n+2)/2 thanks to Lemma 10.1.6, the result follows. 

■ 

In order to obtain an expression for Gai when a 7^ 2, we find an 
exphcit orthonormal basis of T^Aj^^Ai) QTxVai for x in Vai ■ It is then 
possible to express the matrix Gai in this basis. The construction goes 
as follows. 

For a real matrix M = (^i,j)i<jj<„ we denote by M,j = 
{Mi,j)i<:i<n (rssp. Mj^,) the vector in R" made of its j-th column 
(resp. z-th row). 

Notice that any x in "Dai is also in SL(n, M), and so is invert- 
ible. For 1 ^ i,j ^ n, i ^ j, let y*- G M*^ be an orthonormal 
basis of { }-*-, the orthogonal subspace in of the i-th col- 

umn vector of x^^. Define also e = {{x~^'^x~^)i^i) ^^^^^ G R", the 

vector whose coordinates are the diagonal entries of x~^'^x~^. Fur- 
thermore, define yj, 1 ^ i ^ n — 1 to be an orthonormal basis of 
{ e }^ where { e}"*" — in M" — is equipped with the quadratic form 
Proj|g}±diag((x"^'^a;"^)j,i)^^.^^|^g-^j.- This quadratic form is the 
compression to { e }-*- of the diagonal matrix obtained by writing the 
components of e on its diagonal. We denote by 7/^ 1 ^ A; ^ n, the 
components of the vector in R", 1 ^ i, j ^ n. Finally, define 

10.1.9. LEMMA. In M(n,R) = M"', the - 1 vectors F''^ , for 
i,j in { 1, 2, . . . , n } with {i, j) ^ (n, n), form an orthonormal basis of 
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Proof. Let us first show that ah the matrices F^'^ are orthogonal to 
x~^'^. Indeed, if i 7^ j, 

while for i = j, 

= {yle)=0. 

To check that we have an orthonormal basis, we use the identity 

{E''^,E'''^) = If « / J and p ^ q, 

Next, for i ^ j, 

Finally, for i, j = 1, 2, . . . , n — 1, 

= E yHji^-'^^-'ki 
= kj 

by our choice of ■ 

Combining Lemmas 10.1.7 and 10.1.9, we can calculate the (n^ — 

1) X {n? — l)-matrix F*^'')) where {k,l) belong 

to { 1, . . . , n } X { 1, . . . , n } \ { (n, n) }. This amounts to writing the 
matrix Gai{x) in the orthonormal basis F^'^ , 1 ^ i,j ^ n, ^ 
{n, n). The explicit calculation is rather long, and unfortunately does 
not seem to simplify much. But the work done so far is all that we 
need to implement the approximation numerically. 

To conclude the proof of Theorem 10.1.1, it remains to check the 
assumptions of Theorem 7.1. 



210 



Chapter 10. Random matrices 



Assumptions (7.1) and (7.2) hold since we assume a > 1. Assump- 
tion (7.3) is trivial since we assume a ^ 2. 

Assumption (7.4) is guaranteed by Conjecture 10.1.3 when a 7^ 2, 
while it is trivial for a = 2. 

To check assumption (7.5), use Remark 7.3 and the calculation of 
Ddet and D/ made in this section. Indeed, Lemma 10.1.1 yields 

(D/(x),Ddet(x)) = i|x§;^:;j>0 

for any x in Va^- This concludes the proof of Theorem 10.1.4. ■ 

When a = 2, we infer the following corollary. 

10.1.10. COROLLARY. Let (Xij)^^- be an nxn random matrix, 
with independent and identically coeffieients all having a standard 
normal distribution. The distribution of t^^^'^X given detX ^ t 
converges weakly* to the uniform distribution over SO(n, M). 

Proof. It is now a straightforward application of Theorem 7.5. ■ 

10.2. Random determinants, heavy tails. 

Wc now consider the problem of approximating the tail probability 
of the determinant of a random matrix, assuming that its coefficients 
are independent and all have a Student like cumulative distribution 
function Sa- For this problem, the general framework proposed so 
far can be used. The argument is very much like that used to prove 
Theorem 8.3.1. For a change, we will give a probabilistic proof, which 
is actually inspired by Theorem 5.1, showing another sort of use of 
that theorem. This proof will be far less conceptual, and will give no 
insights. 

The result is as follows. 



10.2.1. THEOREM. Let X = {Xij)i<^ij^n be a matrix with inde- 
pendent and identically distributed coefficients, all having a Student 
like distribution with parameter a. Then, 

r.r . . n(2X3„a^"+i)/2)" (logt)"-i 

P{detX^t} ^ — as t^ 00. 
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Proof. We will see why Theorem 5.1 suggests that, as t tends to 
infinity, 

p{ detx ^t}=p{ j2 n ^ * } 

= n\P{ n X,,i^t}. 



Admitting this relation, our first lemma gives the key estimate. 

10.2.2. LEMMA. Let Xi, . . . , he n independent random variables 
with Student-like distribution Sa- The product Xi...Xn has upper 
tail 

nX^...X^;,t}r.^-^-—^^ ^ ast^oo. 

Its lower tail is equivalent to its upper tail. 

Proof. We proceed by induction. For n = 1, the result is plain from 
the definition of Student-like distributions. Call F„ the cumulative 
distribution function of the product Xi . . . X^, and Cn the constant 

_ (2i^,,„a("+i)/2)" 



2a(n-l)! 

Assume that Fn-i has the form given in the statement of the lemma. 
Then 

1 - Fn{t) = P{Xi. ..Xn-1 ^ t/Xn ; X„ > } 

+ P{Xi...Xn-i^t/n; Xn<0}. 

Let us evaluate the first probability in the sum. The second one is 
either evaluated in the same way, or is obtained from the first one 
by changing Xn into — This first probability can be rewritten as 

J^l-Fn-l{t/x)d3c,{x). 

Let 5 be a positive number. Using the induction hypothesis, there 
exists a positive M such that for any y larger than M, 

n (logy)""' ^ -, J, /wn^A^ (^Qg^)""' 

(1 - 5)Cn-l ^ 1 - Fn-l{y) ^ (1 + 6)Cn-i . 

y y 
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Moreover, taking M large enough, we also have 

y y 

Consequently, 

rt/M j-t/M noet/x)"-2 

/ l-F^_,{t/x)dS^{x)^{l + 5)cn-i / ^-^-L±—dS^{x). 

(10.2.1) 

We integrate by parts, writing 

/ x''{logt/xr-^dS^ix) = x"(logi/x)"-2(S,,(x) - 1) 

pt/M 

- / (ax"-i(logt/xr-2-(n-2)x"-\logt/x)"-3)(S'«(x)-l)dx. 

The number M can be taken large enough so that 1/logz ^ 6 for any 
z greater than M. Then, 

ft/M 

/ x''{logt/xr-^dS^{x) 

rt/M I , J , n—2 

^0(logi)""^ + (a(l + 5)ci + 5(n-2)) / - log- da; 

J M X \ X/ 

ft/M 1 

= 0(logt)"-2+ (aci(l + (5) + (5(n-2)) / -(logyf-^dy 

Jm y 

= 0(logt)"-2 + (aci(l + 5) + (5(n - 2)) M*)^ ^ ^^^^^ 

n — 1 

as t tends to infinity. Therefore, (10.3.1) yields 

ft/M 

/ l-Fn-l{t/x)dSa{x) 

Jm 

^ ^-^cn-i (a(l + S)c, + S{n - 2)) ^i^^O^ (i + . 
r" n — 1 

We obtain a similar lower bound, replacing 6 by —5. 
In the range of integration x > t/M, we have 

/ l-Fn-lit/x)dSa{x)^l-Sait/M) = 0{t-'^) OS t ^ OO . 

J t/M 
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On the other hand, when x < M we have 

J^"" (1 - i^n-l(t/x)) dSaix) ^ 1 - F„_i(t/M) = 0(^i5^^) 

as t tends to infinity. Since S is arbitrary, we proved that 

PS Y Y ->i/Y ■ Y ->n\ „ CiCn_i (logt)""^ 

F{Xi... Xn-i ^ VA„ A„ ^ 1 ~ a — , 

as t tends to infinity. Therefore, 

CiCn-l (logt)"-l 



1 - Fn(t) ~ 2a- 



n-1 



Since 2aciCn-i = Cn, the result on the upper tail follows. The lower 
tail F{t) as t tends to infinity is handled in the same way. ■ 

The next lemma will allow us to prove that if detX ^ t and t is 
large, it is very unlikely that two different products Hi^i^n "^i,c^(«)' 
a G &n, are both of order t. 

10.2.3. LEMMA. Let Xi,. . . ^X-^+k be n + k independent random 
variables with Student-like distribution function Sa- For a > 1, and 
k positive, 

P{Xi. . .Xn ^t; Xk+i . . . Xk+n ^t} = o(^ ^ ^ 

as t tends to infinity. 

Proof. If k ^ n, the result follows from independence and Lemma 
10.2.2. Thus, from now on, assume that k < n. Set Y = Xi . . .X^, 
Z — XkJ^x . . .Xn and U = Since these random 

variables are independent, we have 

P{YZ^t-ZU ^t}= [ P\ Z ^t(-V-)}dFkiy)dFk{u) 

+ / p\z^t(-A-)} dFkiy)dFk{u) . 

Let us evaluate the first integral, the second one being similar. Using 
the symmetry in u and y, it suffices to prove that 

/ P{X^ t/y}dFk{y)dFk{u) = o(r"(logt)--i) 
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as t tends to infinity. 

Let us use the notation Ck as in the proof of Lemma 10.2.2. Let 
5 be an arbitrary positive number. Then, there exists a positive M 
such that for any u> M 

(1 - i)c„-t "°'^ « F{ Z S u } < (1 + .i)c„-t "°'^"^°"*"' 

(1 - *)c»fi^ « 1 - F.(„) « (1 + . 

We then have, using Lemma 10.2.2 and the fact that k is strictly less 
than n 

t/M<u 

^ / dFfc(«) = 0(t-(logi)*^+i) 

Jt<M<u 

= o(r"(iogtr-i)). 

Thus, we need to prove that 

/ P{Z^ t/y}dFfc(y)dFfc(^x) = o(r"(logO"-') 

J M<y<u<tlM 

(10.2.2) 

as t tends to infinity. We first perform the integration in it, obtaining 



J M<y<u<t/M 

= I P{Z^t/y}{l-Fk{y)-l + Fu{tlM))dFk{y) 

JM<y<t/M 

^ (1 - Fk{M)) [ P{Z^ t/y}dFk{y) . 

J M<y<t/M 

3 then use the bound on the tail of Z and integrate by parts, 
/ P{Z^t/y}dFk{y) 

J M<y<t/M 

^{l + S)cn-k (log-) l^dFkiy) 



^(l + <5)c„_fc[(log-) ^L.{Fk{y)-l) 



t/M 
M 



+ (l + S)Cn-k ^(log-^ 



(1 + 5)Cn-k f 

Jm 



lM<y<t/M ^ y 

(alog^ + {n-k- 1)) (Ffe(y) - l) dy 
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Using the bound on 1 — Fj^, we obtain 

/ P{Z^tly}AFk{y) 

JM<y<t/M 

^ 0((logi)"-*^-Vi") 

+ 0(1) / (logt/yr-fe-^ i\ogyf-^ 

JMKyKt/M t"' y 

The change of variable v = (log y) / log t shows that 

(logt - logy)-^-^-^ {logyf-^ 



M<y<t/M y 



dy 

Jo 



Consequently, (10.2.2) holds as well as Lemma 10.2.3. ■ 

We can now prove Theorem 10.2.1. For a permutation a in 6„, 
define 

For any fixed positive 5, 

P{ deiX ^t} = P{ ^Y^^t] 

U {{ya^t{l + 5)}r] n {\Yr\^t6/n\})} 



^ 5^ P{y, ^t(l + 5)}n n {\Yr\^t5/n\}} 



- P{Ya,^t{l + S);Y,,^t{l + 6)} 

lTl,0"2eSn 

From Lemma 10.2.3, we infer that 



J2 P{Ya, > t{l + 5) ; y., ^ t(l + ,5) } = ' ) 
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Moreover, if r and a are distinct, Lemma 10.2.3 implies 

P{ ^ t{\ + b) and \Yr\ ^ tjM. } = o(^^^^^^^ . 
Consequently, using Lemma 10.2.2, 

5^ p{F.^t(l + <5)}n n {l^rl^iVn!}} 



o-eSn T€e„\{o-} 
~n!P{yid^t(l + 5)} 

This proves the lower bound 

P{detX ^t}^ -{2K,aa"-i'r ^'''^T~\^^Tl 

as t tends to infinity. 

To obtain a matching upper bound, notice that 

P{detX ^i} ^ P{3ae 6n, Y„ ^ t{l - 6) 

and Vr G 6„ \ {a} , \Y„\ } 
+P{ 3 n, T2 G 6„ , n ^ r2 , ^ (^t/n! ; Y^^ ^ 5t/n\ } 

Applying Lemma 10.2.3 and 10.2.2, we obtain 
P{detX^i}^ ^ P{ ^ t(l + o( ^^"^^y ) 

aG6„ 

(2i^,,^a^)"(logt)"-M + o(l) 

~n 4: ^ — ^ as i — >■ oo . 

2q; i° (1 - (5)" 

Since 5 is arbitrarily small, we proved Theorem 10.2.1. ■ 

Let us now show why Theorem 5.1 suggested the proof of Theorem 
10.2.1. Define the set 

At = {x& M(n,M) : deta; ^t} = t^l'^Ai . 

Theorem 10.2.1 provides an estimate for the integral 
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The change of variable Y = Sa{X) leads us to introduce 
It allows us to rewrite the integral under consideration as 



where 



Bt 



^(2/) = ^ + log(2vr) 



is convex. To minimize / over Bt, take a matrix x in dAi that is 
in SL(n, M). Then y = $^0 Sa{t^/^x) is on the boundary of Bt. 
Furthermore, 

Thus, for /($^o Sci{t^^'^x)) to be minimum asymptotically, x should 
have as many zero components as possible, namely n. The matrices 
of SL(n, M) with n nonvanishing entries form a subgroup which can 
be described as follows. Define the matrix 



-^"1,71-1 = 



-1 
Id„_i 



Let DSL(n, M) be the subgroup of all diagonal matrices in SL(n, M). 
To a permutation a in 6„ we associate the matrix of its permutation 
representation, conveniently denoted a as well. Thus, aci = e^^^y 
Denote by ©n,+ the subgroup of all even permutation of n elements. 
Equivalently, &n,+ is ©^(^^^(n, M). Denote ©n,- the subset of ©„ 
of all odd permutation matrices. Let (/i,n-i©ri,-) be the subgroup 
of SL(n, M) generated by the matrices with a G ©n,-- Then 

&n,+ U (/i,n-i©n,-) is a group made of matrices which are, up to the 
sign of their entries, permutation matrices, and are of determinant 
equal to 1. This group acts on DSL(n, M) by 

(a,m) G (©n,+ U (/i,„_i©n,-)) X DSL(n,M) ^ am G SL(n,M) 

Denote by (©«,+ U (/i^„_i©„^_))DSL(n, R) the image of this action. 
One easily sees that it is a subgroup of SL(n, M), made of all the 
matrices with exactly n nonvanishing entries. Let x = am be 
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in this subgroup. Using Lemma A. 1.5 and the fact that detm = 

= (alog(tV"|rn,,,|) -iloglog(tV-|^^_.|) 

- 21og(K.,«a"/220F)) + log(2^)"'/2 + 0(1) 
= alogt - ^loglogtV'i _ 2dlog(i^,,aa"/'20F) 
+ log(27rr'/2 + o(l). (10.2.3) 

This expression does not depends on m. It suggests that the domi- 
nating manifold in our problem should be 

$^o5a(t^/"(6„,+ U (/i,„_i6n,-))DSL(n,M)) . 

This set is made up of n! connected components, each component 
being DSL(n, M) composed on the left either by an even permutation, 
or by Ii^n-i E^nd an odd permutation. As / is invariant under 
permutations and composition by /i,n-i) all these components should 
be equally likely. Since the distribution of the Xj's is asymptotically 
symmetric, this suggests the approximation 

P{ E «(^) n E ^{ n ^^,aori^)^t}, 

where r is a transposition, depending on a, such that 6^ = { cr, (tot : 
a G &n,+ }- The main reason the proof is compHcated using this 
method is that (10.2.3) is not uniform in m. It is uniform in the range 
fi/n^ ^ oo and log |mj^j|/ logt ^ 0. This is exactly the same problem 
as the one we faced in section 8.3, and a similar parameterization can 
be used. 

10.3. Geometry of the unit ball of M(n,E). 

The purpose of this section is to study some elementary differential 
geometric properties of the set S of all real matrices of norm 1. This 
will be instrumental in the next section to obtain results on norm of 
random matrices. We will prove — Propositions 10.3.1 and 10.3.2 — 
that this set is a fiber bundle over a Klein bottle of dimension 2(n — 1), 
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whose fibers are isomorphic to the unit ball of (n — 1) x (n — 1) real 
matrices. We will explicitly calculate various curvatures of this set. 

Recall that the set M(n, M) = of all n x n matrices with real 
entries is equipped with the inner product 

{M,N)= J2 MijNij=tv{MN^). 
With this inner product, M(n, M) is R" equipped with its standard 

2 

inner product. On R" , the Euclidean unit sphere is a submanifold of 

dimension n? — 1 with constant curvature, whose geometry is very 
well understood. However, for algebraic reasons, it is often more 
convenient to equip M(n,M) with the operator norm 

||M|| = sup{ \Mu\ : u G Sn-i } , 

where | ■ | is the Euclidean norm in R" and 

Sn^i = { X G R" : |a;| = 1 } 

is the unit sphere centered at the origin. The unit sphere centered in 
(M(n,R), II • II), namely 

5 = {M e M(n,R) : ||M|| = 1}. 

is not as familiar as 5^2 _i as far as its geometry is concerned. We 

2 

need to understand what S looks like in M(n, R) identified with R" . 
For this purpose, for any u,v ^ R", define the subspace of matrices 

Hu,v = {he M(n, R) : hu = h^v = 0} . 

In what follows, vectors in R" are considered as row vectors, and so if 
u belongs to R", then it'^ is a 1 x n matrix. We also use systematically 

the tensor product notation; if u, v are two vectors in R"^, their tensor 
product is the matrix u (E> v — ini^ . This notation agrees with that 
used in section 10.1 when we dealt with vectors in M"". 

To understand the geometry of S, it is convenient to remove some 
singular points and define 

= {M ^ S : lisa simple eigenvalue of M'^M } . 

In M(n, R), the closure of S° is S. Proposition 10.3.1 bellow asserts 
that S° is a smooth submanifold of R" . Moreover, is a fiber bundle 
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over a Klein bottle Sn-i Sn-i ^ 5'„_i x Sn-i/{ld,—ld}, whose 
fibers are isomorphic to the unit ball of -ffei,ei for the operator norm. 
So the dimension of the fibers is (n — 1)^. We will show that there are 
no higher dimensional convex subsets in S° — this follows from the 
form of the curvature tensor of <S given in Theorem 10.3.3. Each fiber 
is also is orthogonal to its base point in Sn-i x Sn-i/{ Id, — Id }. 

10.3.1. PROPOSITION Every matrix M in S can be written as 
M = u ® V + h for some u, v m Sn~i, and h G H^^v with \\h\\ ^ 1. 
This decom,position satisfies the following properties: 
(i) up to the transformation {u,v) i— > (—u,—v), it is unique if and 
only if 1 is a simple eigenvalue of M'^M. 

(a) Hu,v is orthogonal to u®v and dim Hu,v = {n— 1)^ for all u, v in 

Sn-l- 

Proof. To check that matrices of the form M = u®v+h, with u, v in 
Sn^i and \\h\\ ^ 1 arc of unit norm, notice that the operator norm of 
such matrix is at least 1, since Mn = v. On the other hand, write any 
vector X of as u{x,u) + Proj^xx where Proj„x is the projection 
onto {u}-^. Then, apply M to a;, use that /i is a contraction and 
belongs to Hu,v to obtain |Ma;p ^ |a;p, and so ||M|| ^ 1. 

To prove that all matrices of norm 1 arc of this form, take u to be a 
unit eigenvector of M'^M with eigenvalue 1. This vector u is unique 
up to its sign if and only if 1 is a simple eigenvalue. Define v = Mu 
and h = M — wS) V. Since 

l = \u\ = \M'^Mu\ ^ \Mu\ = \v\ ^ = 1 , 

the vector v also belong to Sn~i- One easily checks that h belongs 
to Hu,v To see why /i is a contraction, notice first that hu = 0. 
Moreover, if w is orthogonal to u, then \hw\ = \Mw\ ^ \w\. The 
uniqueness statement is then clear. 

The orthogonality relation (ii) follows from {u 'S)v,h) = tT[vu^h"^) 
= 0, for /i belongs to H^^^. 

To obtain the dimension of Hu^v^ write Ru as an orthogonal matrix 
mapping the first vector of the canonical basis of M", say ei, to u. 
Then Hu,v = Hji^ei.ei^u- Hence, dimHu^y = dimi^g^^gj. Since the 
equations determining i/ei,ei are 

hi,i = /ii,2 = . . . = /ii,n = and = /i2,i = . . . = hn,i = , 



we have dimi^„^^ = (n — 1)^ as claimed. 
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For the unit sphere Sn-i in M", it is an obvious fact that the tangent 
space at any point u is just the subspace orthogonal to u in R"^. So 
one may wonder if this property has an analogue for the unit ball S. 
Our next proposition shows that this is somewhat the case and gives 
an explicit description of the tangent spaces. This will be useful in 
calculating the curvature tensor of S°. It also proves that the fibers 
Hu,v are not only orthogonal to u (Si v but also to the tangent space 
Tu^y{Sn-i <8) •S'n-i). Hence, they point orthogonally to the base. 

Notice that if h belongs to Hu^v, the image Imh = /iR" is included 
in { u }-*- = TjjSn-i, while Imh"^ C { ^^ }"*" = T^Sn-i- For u, v in Sn-i 
and h in consider the following subspaces of M(n, M), 

Hl,v,h = {a®v-u® (ha) : a G r„5„_i } , 
Hlv,h = {u(Sb- {h^b) Sv : be T^Sn-i } • 

10.3.2. PROPOSITION. Let u, v be in Sn-i and h be in Hu,v with 
\\h\\ < 1. Then 

(n) T„^,+,50 = © (Hl^^, + Hl^^,) . 

The vector uS v is an outward unit normal to at all points of the 
form u ® V + h, with u,v G Sn-i and h a contraction belonging to 

Proof. It is convenient to notice the following trivial identity which 
will be used repeatedly: for any a, 6, x, y in R", 

{a® b,x ® y) = ix{ba^ xy"^) = (a, x) {b, y) . 

(i) Let a be in T^Sn-i, and 6 be in T^^Sn-i- Define 

x = a(Siv — u(S {ha) G , 
y = u^b- (h^b) ^ve Hl^^^ . 

Since u is orthogonal to a and v to 6, and h is in Hu^y, 

\{x,y)\ = \{ha,b) + (a, h"^b)\ ^ \ha\\b\ + \a\\h^b\ . 

Moreover, for the same reasons, 

|a;|2 = |a|2 + |/ia|2 , and \y\^ = \b\^ + \h^ bf . 



222 



Chapter 10. Random matrices 



Therefore \{x,y)\ < \x\\y\. Thus, x and y cannot be coUinear, and 

^l,v,h^^u,v,h = {0}- 

(ii) The inclusion of H^^v into Tu(g,v+hS^ is clear: consider the tangent 
vector at of the curve s<-^u'S)v + {l + s)h G <S°. 

Next, consider two curves u{s), v{s) in Sn-i, with u{0) = u, 
v{0) = V, u'{0) = a, v'{0) = b. Let h{s) be a curve in M(n,M) 
such that h{s) is in Hu(s),v{s)i h{0) = h, and h'{0) = k. The tangent 
vector at of the curve u{s) v{s) + h{s) in is a v + u b + k. 
Differentiating the relation h{s)u{s) = h'^{s)v{s) = at s = yields 

ku = -ha and k^v = -h^b. (10.3.1) 

Taking 6 = 0, one sees that k = —u (g) {ha) satisfies (10.3.1) and so 
a (g) v — u (g) {ha) is in the tangent space Tu^^+hS'^. Hence, H^^j^is a 
subset of Tu^^+hS^. 

Considering a = and checking that k = {—h^b) ® v satisfies 
(10.3.1) yields the inclusion of i/^ ^ ^ in Tut^^+hS^- 

The orthogonality of i?^ ^ ^ and H^^y comes from the fact that for 
a in TuSji-i and h in Hu,v, 

(a (8) u — It (8) {ha), h) = iT{av^h — u{ha)'^h) = . 

Similarly, one proves that t, ^ is orthogonal to Hu,v ■ 

As a consequence of Proposition 10.3.1, is a manifold of dimen- 
sion — 1 and dimiJ^^^ = (n — 1)^. Thus, 

dim {H^,, e {Hl^^f^ + Hl^^^)) = dim5° 

and we indeed found the whole tangent space to (S° — and not only 
a subspace. ■ 

We can now construct explicitly an orthonormal basis for the 
tangent space in which we will express the second fundamental form 
of the immersion <S° C M" , and hence the curvature tensor of »S°. 

For this purpose, we denote by e^, • • • , e'^-i an orthonormal basis 
of TuSn-i- Whenever h belongs to Hu,v, the vector u is in the kernel 
of h^h. Thus, if \\h\\ < 1, the matrix (Id + h^h)-'^/^ is well defined, 
and { u }-*- is an invariant subspace for this matrix. Consequently, the 
vectors 



ai = (Id + /i'^/i)-^/^e^ G T^S^-i , 1 ^ i ^ n - 1 , 
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are all in TuSn-i- They even span T^Sn-i, because so do the e"'s and 
\\h\\ < 1. The matrices 

fi = ai® V — u® {htti) , 1 ^ i ^ n — 1 , 

form an orthonormal basis of ^ ^ since an elementary calculation 
shows 

(/i, fj) = (aj> (M + h^h)aj) = 5ij . 
To construct an orthonormal basis of the orthocomplement 

notice that for h in H^^v with \\h\\ < 1, the subspace { }-*- is invariant 
under (Id + hh^f/'^ and (Id - hh^)-^. Thus, 

hj = (Id - hh^)-^{ld + hh^y/'^e] G nSn-i ■ 

For any b in TySn-i, define 

V' = h-2 {b,hai)hai e {v}^ , 

l^i^n— 1 

b'' = h^b-2 {b,hai)ai e {u}^ . 

l^i^n— 1 

The vectors b^ = {bj)^ and bj = (6j)*' are then defined, and so are the 
matrices 

gj=u® bj -b'^i^v e M(n, R) . 

Using the bilinearity of the tensor product, we deduce that gj belongs 
to K,v,h + K,v,h since u ^ bj - {h%) (g) is in Hl^ ,^ while 
ai<Siv-u<Si (hai) is in H^.^^^. 

10.3.3. PROPOSITION. The matrices fi, gj, 1 ^ i, j ^ n - 1, form 
an orthonormal basis of ^ + ^ j^. 

Proof. It remains for us to prove that the gj 's are orthonormal, and 
that they are orthogonal to the /j's. Since 6" is orthogonal to u and 
bl to V, 

{gj,9i.) = {blbl) + {blbl). 
Using the expression of 6", 6^, 6J and 6^, we write 

(^jiS'fc) = {bj,Qbk) , 
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where Q is the matrix 

Q = hh^ + ldL + A ^ haiajh^{{ld + h^h)ai,ai) 

— 8 haiajh^ . 

l^i^n— 1 

Since ((Id + h^h)ai,ai) = Si^i, we have 

Q = hh^ + ld-4 haiajh^ . 

l^i^n— 1 

Notice that 

J2 a,aj= {U + h^h)-'/^efef {Id + h^h)-'/^ 

l^i^n— 1 IsCj^n— 1 

= (Id + /i^/i)"^/¥roj„x(Id + h^h)-^/^ . 
Since the image of /i^ is orthogonal to u and { 'U }"*" is invariant under 

(Id + /lT/i)-V2^ 

we obtain 

g = + Id - 4/i(Id + h^h)-^h^ . 
This expression simplifies further since 
(Id - hh^){ld + hh^)-^{ld - hh^) 

= (Id - hh')(j2^-lf{hh'f - Yi-^tihh'f^^) 



k+2 

k>Q k>Q 



= Id + + 4 J](-1)'=(M'^)'= 

= Id + hh^ - 4h{ld + h^h)-^h^ 
= Q 

Consequently, replacing bj by its definition, 

{gj,gk) = {{ld-hh^)-\ld + hh^)'/^e], 

Q{ld-hh^)-\ld + hh^)^/hl) 
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To conclude the proof, we calculate 

(/ii 9j) = {ai <Si V — u <Si {hai) , u^b'j - bj (g) v) 
= -{ai,b]) - {hai,b^j) . 

Since 

b'^ + h^b] = 2h^bj - 2 {bj, hak) (Id + h^h)ak , 

we deduce that 

{fh 9j) = -2(ai, h^bj) + 2 ^ {bj, hak)5i^k = 

as claimed. ■ 

Consider an orthonormal basis /ife, 1 ^ A; ^ (n — 1)^, of Hu,v 
Furthermore, define the vectors 

Cj = - [Id - 2(Id + h^h)-^] h^bj G TuSn-i . 

Quite remarkably, it is possible to cxplicitely calculate the curvature 
tensor of through its second fundamental form. 

10.3.4. THEOREM. For u, v in Sn-i, for h in Hy,^^ with \\h\\ < 1, 
the second fundamental form, of S at u <Si v + h in the orthogonal 
basis fi,gj,hk, I ^ i, j ^ n — 1, 1 ^ k ^ {n — 1)^, is given by the 
(n^ — 1) X (n^ — 1) matrix 

n—l n—1 v?—2n+l 

n-1 I {ai,aj) {ci,aj) 

n = n-1 I {ai,Cj) {ci,Cj) + {bi,{ld — hh^)bj) 

n2-2n+l \ 



Proof. Let N = u®v be the outward unit normal to S at u®v+h — 
see Proposition 10.3.2. We will denote by V the covariant derivative 
on S^; that is, for a vector field X defined on S^, for u a tangent 
vector vector field and p a point on 5°, 



VuX{p) = Vto]t^S<^T)X{p)-u. 
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The calculations made in the proof of Proposition 10.3.2 show that 
fi is the tangent vector at of a curve s i— > u(s) <Si v + h{s) with 
u{0) = u, h{0) = h and u'ijS) = and h'{0) = —u^hai. Consequently 
V f^N = aiiSiv. Moreover, 

l^i^n— 1 

= u®bj + 2 {hj,hai)ai® V . 

l^i^n— 1 

Moreover, since A'' is constant along h G Hu,v i-^n<8)t' + /i, ||/i||<l, 
we have ^h^^ = 0. A routine calculation gives the first entries of the 
matrix, namely 

Next, we have 

(Vgi-/V, fj) = -{h^bi, Qj) +2 ^ {h^bi, a^) {a^, aj) 

l<A;<n-l 

= {aj, -h^bi + 2 ^ akolh^bi) . 

l^fc^n-l 

Since the image of is orthogonal to u, 

aualh^ = (Id + hh^Y^h^ . 

This gives the entries in {ai,Cj). 
Finally, wc calculate 

(Vp,iV, Qj) = {u®bi + 2 ^ {bi, hak)ak ® v , u®b] - h^^ ® v) 

= {h,b])-2 {biMk){akXj) 

= {bi,Qbj) , 
where the matrix Q is 

Q = Id — 4 Y^ haiajh^ + 4 hakaj hT' {a^, ai) . 

i^;^n-i i^fc,;^n-i 

Again, since the image of Ji^ is orthogonal to u, 

/i ^ aiajh^ = /i(Id + hh^Y^h^ . 
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Moreover, 

h ^ akaj{ak,ai)h? 



= h{ld + hh^)-^/^ ekej{ek,{ld + h^h)-'^ei) 

= h{ld + hh^y^h^ . 
Consequently, 

g = Id - 4/i(Id + + 4/i(Id + hh^r^h^ 

= Id - + /i(ld - 2(Id + hh^y^yh^ . 

This gives 

(Vg^AT, Qj) = {hi, (Id - hh^)bj) + (ci, cj) 

as claimed. 



It follows from Theorem 10.3.4 and elementary results on immer- 
sions that the Riemannian curvature tensor R of <S° can be calculated 
expHcitly in the basis fi,gj, hk- It is convenient to define fn-i+i = Qi 
for i = 1, . . . , n — 1 and /2(n-i)+j = /ij for z = 1, . . . , (n — 1)^. If X, Y 
are two elements of Tu(g,v+hS^ , then 

{R{fi, fj)X, Y) = X^{Vf,N{Vf,Nf - Vf.N{Vf,Nf)Y. 

Thus, R{fi, fj) is the compression to the tangent space of the matrix 
V/,iV(V/.iV)T-V/.iV(V/,iV)T. 

As a byproduct of the work done, we can prove that there are no 
fiat and nontrivial convex subsets in besides the unit balls of the 
fibers Hu^v This result will not be used in the sequel, but brings 
more intuition on the shape of the sphere S. It is enough to show 
that the 2(n — 1) x 2(n— 1) upper left corner submatrix of 11 has no 
zero eigenvalue. Since \\h\\ < 1 on our parameterization of (S°, this 
follows from the next result. 

10.3.5. PROPOSITION. The following equality holds, 

det I j 

V (f^i' {ci, Cj)ij + {bi, (Id - hh^)bj)ij J 



= det(Id + hh^) det(Id + h^h) det(Id - hh^) 
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Proof. We first calculate a subdeterminant of the given one. Going 
back to the definition of the Oj's and using that u is an eigenvector of 
(Id + /i^/i) associated to the eigenvalue 1, 

det((ai, aj))ij = det{ef, (Id + h^h)e]))^^. .^^_^ 
= det(Id + h^h) . 

Furthermore, since v is an eigenvalue of (Id + K^hY^"^ and (Id — 
hJi^)"^ associated with the eigenvalue 1 — this can be seen by series 
expanding and using the fact that K^v = — 

det((6„(Id-MT)6,))^^^. 

= det((e^, (Id + hh^f/'^{ld - hh^)-^{ld + hh^)^/'^e])i,i) 
= det(Id + hh^) det(Id - hh^) . 

To conclude the proof, we use the following claim, with di = (Id — 

Claim. Let Ui, Ci, di, 1 ^ i ^ n — 1, be 3(n — 1) vectors in M". 

Consider the nx(^n — l) -matrices a — (04, . . . , a^-i), c = (ci, . . . , c„_i) 
and d = (di, . . . , dn-i)- If the image of c is contained in the image of 
a, then 

det ( [ ] ( a c)+(l d^d))= • 



To prove the claim, let P be an orthogonal matrix and D he a 
diagonal one such that cFd = PDP^ . Writing M for the matrix 
whose determinant we want to calculate, we have 

de*M = <iH(('„^ e,C^ »)) 

The proof then goes by induction on the dimension of noticing 
that for any real number 5, 
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where Vm,m is the (m — 1) x (m — 1) left upper corner of V and m the 
dimension of V. Consequently, we just need to prove that det V = 0. 
This is clear since the condition Imc C Ima implies that the rank of 
the matrix (a c) is the dimension of the image of a, and hence the 
rank of V is at most dim Ima. Consequently, we have 

dei(v+(^'^ ^ = det(a'^a)detL». 

This proves the claim and concludes the proof of Proposition 10.3.5.B 

As a consequence of Proposition 10.3.5, the Gauss-Kronecker cur- 
vature of the nonflat part of 5° at u v + h is given by det (Id + 
/i^/i) det(Id + /i/i^)dct(Id — hh/^). Other curvatures can be calculated 
as well, leading to more or less interesting formulas. 
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Let us again consider a random matrix X — i-^i,j)i<i j<n ^i^^^ 
independent and identically distributed coefficients. Its (operator) 
norm is 

||X|| = sup { \Xu\ :\u\ = l,ueW} . 

In this section, we will obtain estimates for the tail probability 
P{ ||X|| ^ t }, assuming that the Xjj's are either symmetric Weibull 
or Student like distributed. 

In theory, we just need to apply the results of chapter 9. Indeed, 
being reflexive, ||X|| = sup { {Xu,v) : u,v ^ Sn-i}- Since 
{Xu,v)Mn = {X,u (g) 'y)]g„2, we see that ||X|| is the supremum of the 

2 

linear form X acting on the submanifold Sn-i 'S> Sn-i of . 

However, a direct application of the results of chapter 9 in the case 
of light tails is not that easy. We will proceed by using both chapter 
7 and ideas from sections 9.1 and 9.2 as well. 

Our first result is for light tails. 

10.4.1. THEOREM. Let X = {Xij).^^^- be a random matrix 
with independent and identically distributed coefficients, all having the 
symmetric Weibull-like density 
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(i) If a = 2, then 

P{ \\x\\ ^t}^ ^_^p^e-*V2i2(n-i)-i ast^oo. 
Ill II / ; 2"-ir(n/2)2 



(ii) If a > 2, then 



2 



(•27r)(" -l)/222"-l g-t"/(«n"-2) 



yj(2-a)(n2+i)/2^Q, _ 2)"-l(a - l){n-ir/2 (n^ + l)-n2 

as t tends to infinity. 
Proof. Define 

At = {x = G M(n, M) : ||x|| ^t}=tAi. 

We need to evaluate 

Defines tlie a-liomogeneous function 

We can apply Theorem 7.1. The first step is to calculate I{Ai) and 
P^i- To do this, we need a description of the boundary 

dAi = {xe M(n, R) : ll^ll = 1 } , 

that is of the sphere of radius 1 in the space of matrices endowed 
with the operator norm. This is provided by Proposition 10.3.1. Let 
us simply recall here that the matrices of norm 1 coincide with all 
matrices of the form wSiv + h, where u, v belong to the sphere Sn-i 
and h is em n X n matrix of operator norm less than 1, satisfying 
hu = h^v = 0. This allows us to find Vai ■ 

10.4.2. LEMMA. The function I is minimum over dAi exactly at 
matrices of the form u®v with 

(i) u,v e {n-V2(e^^. . . , e^) : G { -1, 1 } , 1 ^ i ^ n } ifa>2, 
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(ii) u,v £ Sn-i if ce = 2, 

(in) u,v £ [eci : e & { —1, 1}, l^i^n}, if a < 2. 

Proof. As we already mentioned, 1 1 a; 1 1 is the supremum of the Hnear 

2 

form X £ M" acting on Sn-i (8) Sn-i- Proposition 9.1.7 and convexity 
of A1 implies I{dAi) = I,{Sn-i (8) Sn-i)- Moreover, Lemma 9.1.9 
asserts that I,{x) = 1/(q!|x|^) where + = 1. We can first 
calculate the points in Sji—i (8i Sji—i which minimize /, . This is rather 
easy since 



= \V\»'^\U\ 



1/3 

Thus, we need to locate the maxima of |it|^ on 5^-1- 

If a is larger than 2, then /? is smaller than 2. Therefore, 

Ge N^)"%(iE«?)"^ = ;;.> 

with equality if and only if \ui\ = l/-\/n for all z = 1,2, ...,n. 
Consequently, 



1 1 



sup { \u\/s : u G Sn-i } =ni^ ^ , if a > 2 . 

If a = 2, then /? = 2, and \u\p = 1 over all Sn-i- 
Finally, if a is smaller than 2, then /? is larger than 2. A unit vector 
u has all its components Ui between —1 and 1. Therefore, 



with equality if and only if one — and only one — of the |iti|'s is 1. 
Consequently, 



T f ^ \ ^ } if a > 2, 

al,{u^v)^i^ if a ^2; 



with equality for (u, v) = (tt,,, v*) with (u^,, u*) exactly in the following 
sets, 

\u*,i\ = \v^,i\ = l/Vn i = l,2,...,n, if a > 2, 

u^,v^ G Sn-i if a = 2, 

u^,v^ G { ecj : e G { —1, l},l^i^n} ifa<2. 
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For u and v in Sn-i, set 



u,v — 



{heM{n,R) ■.hu = h^' 



= } . 



For h in 



we have 



{u®v,u®v + h) = tr{uv^vu^ + uv^h) = 1 . 



Consequently, 



I,{u (g) v) = inf { I{x) : (x, u (g) f ) = 1 } 

^ ini { I{u 1^ V + h) : h e Hu,v , \\h\\ ^ 1 } . 



The inequahty 



® f*) ^ inf { I{x) : a; G 9Ai } . 



follows. Observe that (g) t)*) = (g for any a ^ 1. Since 
the function h E i— > /(u v + h) is convex, as a restriction of a 
convex function to a convex set, the infimum of / over dAi is achieved 
only at points x = (g f * . On those points /, and / coincide and this 



It is interesting to realize that the proof of Lemma 10.4.2 relics on 
the fact that /. and / coincide on the matrices u^0v^. Geometrically, 
the matrices u v + h, h e Hu,v with \\h\\ ^ 1 frorms a truncated 
cyhnder with base Sn-i <g 5'n-i- What makes the proof work is that 
Sn-i is the polar reciprocal of its convex hull; a very special property 
of the sphere! 

Let us now calculate all the terms that come from applying Theorem 
7.1. Wc will then justify that we can indeed apply this theorem in 
verifying that its assumptions hold. 

Let us first consider the case a> 2. From Lemma 10.4.2, we deduce 



concludes the proof. 



I{Ai) = n- 



2-a 



la. 



The rescaled dominating manifold 
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is of dimension k = 0. Its Riemannian volume is the counting measure 

where the last sum is over all distinct matrices {Vi^j /''''') i<i j<n ^^^^ 
coefficients in {— 1/n, 1/n}. Note that e r] and (— e) (8) (— ??) are 
equal, thus not distinct. 
For X in Va^ , we have 



4-2a 



n 

We then need to calculate the curvature term det Gai , and hence the 
fundamental form IlAj^^_^yU:,m* and n^Ai.M.^)?;, for (g) in P^i- 
From Theorem 10.3.4 with h = 0, we deduce that 

^aM = ['\'^ ||)GM(n^-l,R). 

On the other hand, the second fundamental form of ^i(Ai) at x is the 
restriction to the tangent space T^Ak^Ai) of 

Thus, if X is in we have \xij\ = 1/n and 

1 a-1 

= — 7^ Tid„2 = (a — lUd„2 . 

|D/(ar)| n2-« n"-2 ^ ^ 

Hence, for x in , 

nA,(^^,,c, - UgA,,x = {a- l)Id„2_i - ^^^2(n-l) 0^ 

(a - 2)Id2(„_i) 

(a - l)Id„2_2„+i 

Therefore, on Vai , 

detGAi(x) = (a - 2)2("-i)(a - l)"'-2n+i . 
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We obtain the constant ci in Theorem 7.1, 

1 X (n2+l)/2 



(27r)("'-i)/222"-i 



„(2-a)(n2+l)/2(Q, _ 2)"-l(a - 1)('»-1)V2 

Putting all the pieces together, 

PM^) ^ ( " ^ ^ J X 



2 



(27r)("'-i)/222"-i 



(2-a){n2+l)/2(Q, _ 2)'i-l(a - l)(n-l)y2 



n 



as t tends to infinity, which is the result. 

Let us now turn to the case a = 2. From Lemma 10.4.2, we conclude 

/(^i) = l/2. 

The dominating manifold 

'^Ai = Sn-l <8) Sn-l 

is now of dimension k = 2(n — 1). Since ^n-i x Sn-i is a double 
covering of T>a^ , the Ricmannian measure on Pyi^ is half the product 
measure on the product of 2 spheres Sn-i, each having the Rieman- 
nian measure obtained from the Lebesgue measure on M". 
On , we also have 



|D/(x)| = ( J2 Ml') 



1/2 , , 

= \u\\v\ = 1 . 



The computation of the curvature term det Gai goes the same way as 
for a > 2. Namely, we still have 

nA,(^^, = Id„2_i and Uqa, = (^^^2(n-l) 0^ . 
In particular. 



10.4. Norms of random matrices 



235 



It follows that 

is of determinant 1. Therefore, with the notation of Theorem 7.1, 
Again, taking all the above estimates into account, we obtain 



1 



P{ U\\ >t}r^ ^(27r)«"-i) )/2Vol(5„_i)2e-* 

as t tends to infinity. This is the result since Sn-i has volume 
2WVr(n/2). 

It remains to check the assumptions of Theorem 7.1. We already 
checked (7.3) and (7.4). Assumption (7.5) is clear as well since the 
curvature of 5^41 is bounded. ■ 

We can now apply Theorem 7.5 to obtain the following re- 
sult on conditional distributions. It is worth knowing that Si x 
51/1— Id, Id} is the usual Klein bottle. Hence, S'n-i <8) Sn-i = 
Sn-i X /Sn-i/l — Id, Id } is a 2(n — l)-dimensional Klein bottle. 

10.4.3. PROPOSITION. Let X = {Xij)^^-j^^ be a random matrix 
with independent and identically distributed coefficients, all having the 
density 

The conditional distribution of X/t given \\X\\ ^ t converges weakly* 
to a uniform distribution over 

(i) the Klein bottle Sn-i <8) ^^-i if a = 2, 

(ii) the 2^"^"^ matrices of the form it* (8) for \u*^i\ = \v*^i\ = Xj^fn, 
1 ^ i ^ n, if a > 2. 

Proof. It follows from the calculation of T>Ai made in the proof of 
Theorem 10.4.1 and Theorem 7.5. ■ 

Let us now turn to the problem of estimating the tail probability 
of ||X|| when the coefficients Xij of the random matrix X are inde- 
pendent and identically distributed with a Student-like distribution. 
Given the work done in the previous sections, this turns to be an easy 
problem. 
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10.4.4. THEOREM. Let X = ^ random matrix with 

independent coefficients, all having a Student-like distribution with 
parameter a. Then, 

P{\\X\\^t\ ast^oo. 

Ill II / ; 



Proof. Notice that the set 

At^ [x^ M(n, R) : ||x|| ^t) =tAi 

is the complement of a convex set, namely the ball of radius t 
centered at the origin in M(n, M) endowed with the operator norm. In 
M(n, M) = M"^ , the axial points of this convex set are all the matrices 
eE'^'^ , e G { —1, 1 }, 1 ^ i, j ^ n, which are of Euchdean norm 1. There 
are 2n^ such matrices. Apply Theorem 9.3.1 to obtain the result. ■ 

With no extra effort, we can also obtain the following result on 
conditional distribution. 

10.4.5. PROPOSITION. Let X = " random matrix 

with independent coefficients, all having a Student-like distribution 
with parameter a. The distribution of X/t given \\X\\ ^ t converges 
weakly* to the uniform, mixture of the distributions of the matrices 
eZE^'^ with e m { — 1, 1 } and Z having a Pareto distribution. 



Proof. Apply Corollary 9.3.4. The axial points of A\ are the 
matrices eE^'f where e is in {—1,1}. Those matrices are of unit 

2 

Euclidean norm in R" . ■ 

Notes 

The theory of random matrices has been evolving quite fast lately. 
Motivated by applications in physics and in operator algebras, impor- 
tant progress has been made on the asymptotic theory as the size of 
the matrix goes to infinity. Our fixed size viewpoint is quite different. 
Amazingly clever explicit calculations have been made in the Gaussian 
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cases and some of its variations. A classical reference is Mclita (1991). 
Another aspect driven by statistics concerns the Wishart distribution 
— see Johnson and Kotz (1972). 

I believe Lemma 10.2.3 is not new, but I have not found a reference 
for it. It is very similar to Theorem 2.1 of Rosiiiski and Woyczyhski 
(1987), as well as its proof. If we assume that the Xj's are symmetric, 
then Lemma 10.2.3 can be deduced from Rosihski and Woyczyhski 
(1987) in conditioning on the signs of the Xj's. But here, we assume 
only asymptotic symmetry of the tail. Therefore, the signs of XiS 
given \Xi\ large is only asymptotically distributed uniformly over 

Theorem 10.1.4 involves the volume of SO(n,M). It seems to be 
calculated in Marinov (1980). But, by ignorance, I have not been able 
to follow his proof. I don't know if Marinov's results give the vohime 
of SO(n, M) embedded in M" of if they give it up to a proportionality 
constant. 



11. Finite sample results 
for autoregressive processes 



Autoregressive models are among the simplest and most widely used 
models in statistical analysis of time series. Their classical theory 
deals mainly with their asymptotic behavior over a very large time 
period. In this chapter, we will see that these models are in fact much 
more subtle than usually believed. Our study will build upon results 
of Chapter 8. Our results will not exhaust the topic by any mean; 
they should be considered as an incentive for further study. 

11.1. Background on autoregressive processes. 

In order to describe the processes we are interested in, let us introduce 
the backward shift B on vectors. For a vector u = {ui, . . . , u„) in M", 
we write Bu = (0, ui, . . . , Let e be a mean zero random vector 
in M", with independent and identically distributed components. We 
say that the vector X in R" is an autoregressive process of order p 
with innovation e if for some 6 = {6i, . . . ,6p) in MP, with 6p not null, 
it satisfies the equation 

X = OiB'X + e. (11.1.1) 
In a perhaps more explicit form, that means 

Xl =£1 

X2 = OiXi + €2 

= OiXp + + • • • + OpXi + ep+1 

Xp-^-2 = 9iXp^i + 92Xp + • • • + 9pX2 + ep+2 

Xfi = 9lXn-l+ 62X^-2 + • • • + 6pXn-p + e„ . 

For statisticians, the main questions are on estimation and tests 
procedures for such models. This means that one observes the vector 
X, and knows that it is of the form (11.1.1) with the e^'s independent 
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and identically distributed. The goal is then to estimate ^, that is to 
guess its value based on the knowledge oi X; or to perform tests on 
9, that is to check if some assumption on 9 is compatible with the 
observed value of X. How is this done? 

Consider the nxp matrix X = {BX, B^X). Equation (11.1.1) 
becomes 

X = X9 + e. 

Thus, X is a point in the space spanned by X plus a random vector. 
A reasonable guess for 9 is 9ls such that X9ls is the projection of 
X onto the space spanned by BX, . . . , B^X. This is called the least 
square estimator of 9. Whenever X is of rank p, we have 

= {x'^xy^x'^x . 

This can be calculated solely on the observed X. 

Furthermore, notice that the {i, j)-entry of the matrix X^X is 

{B'X,B^X)= J2 Xk-iXk-j 

= ^ XrXj.-\i-j\ , 

l+li-iK''<n-(iAi) 

while the i-th coordinate of X'^X is {B^X,X). 

It is customary to define the empirical autocovariances of order 

A; < n as 

7„(A;)=n~^ ^ XrXr-k- 

Notice that 

{B'X, B^X) - n7n(|i - il) = ^r^r-K-il • 

n— (jAj)<r^n 

Therefore, whenever X^ = Op{l) as n tends to infinity, and i — j is 
fixed, we have 

{B'X,B^X) =n^n{\i-j\) + Op{l) as n ^ oo . (11.1.2) 

This explains why the most popular estimator of 9 is not 9ls but the 
following substitute. Define the matrix 
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and the vector 

In = (7nW)i^i^- 

If the process X„ is of order 1 as n tends to infinity, then (11.1.2) 
shows that 

{n-^X^X)-^ = T-^ + Op{n-^) 

while 

A'TX = 7„ + Op(n-i). 
Thus, instead of using 6ls, one tends to guess 9 by 

9n — Tn 7n • 

Actually, whether we use 6ls or On does not really matter much for 

our purposes. What we do care about is that X is a linear function of e 
as (11.1.1) shows. The autocovariancc 7ri(/c) is a quadratic form in X, 
as well as in e, a classical fact. It is then plain that tail probabilities of 
7n(A:) are relevant to statistics, and that chapter 8 provides the right 
estimates. 

Maybe in order to fully enjoy the results we are going to prove, one 
should know the basics of the classical theory. To make this text quite 
selfcontained, let us sketch it. To this end, define the polynomial 

e(z) = 1 - 5^ Oiz\ zeC. 
Equation (11.1.1) can be rewritten as 



Q{B)X = e. 

Denote by ri, . . . , the complex roots of 6. Then Q{z) = Yli^i^pi^— 

r~^z). We can write 1/Q{z) formally as a series. This is done 
conveniently by introdTicing the vector r = (ri, . . . -.rp). Whenever 
s = (si, . . . , Sp) belongs to Z^, we write \s\ = si + • • • + Sp and 
'^^ = nKi^^^'- We have 



i/e{z) = n (1 - ^T'^i)-' = n E 

k^O \s\=k 
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Substituting B for z, we can formally define &{B) ^. Writing = 
if i ^ s, one can then check that X = Q{B)~^e, that is 

fc^O \s\=r 

When all the roots are outside the unit circle, there exists a positive 
r) such that 

I J2 ^"1 ^ {^+V)~%s : \s\=k} 

\s\=k 

^ (1 + ??)~^ , ^ asfc^oo. 
(p- 1)! 

Therefore if the residuals have a tail which decays fast enough the 
distribution of Xn converges weakly* to that of Ylk^o(}2\s\=k'''~^)^k- 
A simple condition on the tail of for this convergence to hold is 




P{\ei\ ^ t}— < oo. 



A more stringent one is to assume that is integrable; in this case 
Xn converges in as well. 

If now some roots are inside the unit disk, we can first assume 
that ri is the unique root with smallest modulus; and therefore |ri| 
is less than 1. Then, (11.1.3) shows that the distribution of r"X„ 
converges weakly* to a nondegenerate limit. Since r'^ converges to 
exponentially fast, this amounts to saying that the process X„ 
explodes at exponential rate. Some complications occur if the smallest 
root is not unique, but Xn still explodes, essentially at an exponential 
rate. 

As a consequence, the asymptotic behavior of the empirical auto- 
covariances as n tends to infinity is very different according to the 
location of the roots with respect to the unit disk. 

In conclusion, the classical theory makes a great deal of the location 
of the roots of 6 with respect to the unit disk. And it is essentially 
all that it cares about, because only the behavior as the time n goes 
to infinity is considered. In the following sections, we will show that 
it is only a part of the overall behavior of these processes. 
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To end this section, let us examine this root question for autore- 
gressive models of order 1 and 2. 

For an autoregressive process of order 1, we write Xi = aXj_i + e^. 
If |a| < 1, this process is nonexplosive. This can be represented on 
the real line as follows. 

1^ 3 

-1 1 
Shaded region for the nonexplosive domain 

For an autoregressive process of order 2, we write it as X^ = 

aXi^i + bXi^2 + We need to determine where a, b should lie 
for the polynomial — ax — b to have all its roots within the Tmit 
disk. If + 46 is negative, the roots are complex, conjugate to each 
others. They are in the unit disk if and only if their product is less 
than 1, that is if —b < 1. If + 4b is nonncgative and a is positive, 
the largest root in absolute vahic is (a + \/a^ + 46)/2. It is less than 
1 if + 46 < (2 - a)2 = - ia + i, that is 6 + a < 1. One can 
argue similarly if a is negative, and we obtain the following triangular 
domain. 




a 



b = -aV4 



Shaded region for the nonexplosive domain 



11.2. Autoregressive process of order 1. 

In this section, we investigate the tail behavior of the autocovariances 
of autoregressive processes of order 1, 
Xi=ei 

Xn = aXn-i + en , n ^ 2 . 
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Recall that e = (ei, . . . , e„) and X = {Xi, . . . , Defining 

/ 1 . \ 



A 



a 

^2 







B = 



1. 

v 







1-0/ 



The empirical covariance of order k, 

nin{k)= XiXi_k = {X,B''X) = {A^B''Ae,e) 

is a nice quadratic form in e. li k ^ n, then jn{k) = 0. We assume 
from now on that k < n. When e has a heavy tail, the tail behavior of 
n7n(A;) depends on the value of a. It is given by the following result. 

11.2.1. THEOREM. Let X be an autoregressive process of order 
one, with coefficient a and independent and identically distributed 

innovations (4 having a, Student-like distribution with pa,ram,eter a. 
The following expressions are equivalent to P{n'yn{k) ^ t} as t tends 
to infinity. 

(i) If a = and k = 0, 

K,,^a^^-^y^2nt-^/\ 

(ii) If a = and k ^ 1, 

Ks,aa''2{n-k)+t-"logt. 
(Hi) k is even and a ^ 0, or k is odd and a > 0, 

1 _ a'^i a/2 

(iv) k is odd and a < 0, 



-a/2 



Kl^a'^t-'^ \ogt X 



9 _ «2(n+l)r„2(iV(j+fe)) _ „2(3y(i+k))\ „ 
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It is implicit in the statement that the function a t-^ (1 — a*)/(l — a^) 
is extended by continuity at a = 1. Its value for a = 1 is i/2. 

The striking fact is that odd and even autocovariances exhibit very 
different decays when a is negative; the former are of order the 
latter of order log t. 

Proof. Define the matrix C = A^B^A. We apply the results of 
chapter 8. We need to check if the largest diagonal coefficient of C is 
positive, zero, or negative. In order to calculate it, notice that 



and 



otherwise. 



('Dfc\.._ fl if A; + 1 ^ i ^ n and J = i — A;, 
'■' 1 otherwise. 

Consequently, 

, -1 2{n-i-k+l) 

^fcl^o ifaV 1 andn-i-A;^0, 

1 — a 

a^(n — i — k + 1) if = 1 and n — i — k ^ 0, 

ifi^n-A: + l. 

Assume a = and k = 0. Then C = Id. Statement (i) of Theorem 
11.2.1 follows from Theorem 8.2.1. 

If a is null and k is nonzero, the matrix C = has all its diagonal 
elements vanishing. We apply Theorem 8.3.1, calculating 

\{B%,, + {B%,r = 2{n-k)+. 

i:C'i^i=0 IsJi^n 

When k is even and a is nonzero, then Cj.j is positive for any 
1 ^ i ^ n — k. We apply Theorem 8.2.1, calculating 

l^i^n—k l^i^n—k 

This gives statement (iii) , after substituting n — i — A; + 1 for z in the 
summation. 

Statement (iii), when k is odd and a is positive, follows from exactly 
the same calculation. 
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Let us now concentrate on k odd and a negative. Then is 
negative, and so is Ci^i if 1 ^ i ^ n — k, while Cj^j vanishes if 
i ^ n — k + 1. Therefore, we apply Theorem 8.3.1. We need to 
calculate 

We have 

1 - a2("+i-(^v(i+fc))) 
1 — 

We obtain Cj^j by permuting i and j. This gives statement (iv). ■ 

How good are these approximations? Looking at the bound in 
Theorem 3.1.9 and how we derived Theorem 5.1, we cannot expect 
them to be good when we are integrating in a high dimensional space, 
that is when n is large. 

A plot of the approximations given in Theorem 11.2.1 does not show 
much, since all the probabilities go to as i tends to infinity. When 
comparing the tails, it makes more sense to look at the relative error. 
This leads to the plot 

\ogP{njn{k)^t} 

as well as the logarithm of the approximation. These should be 
approximately in linear relation with logt. Therefore, the plots 
below will show the function t i— P{7„(A;) ^ t} with both axes 
in logarithmic scale. Since we do not know a closed formula for 
P{njn{k) ^ t}., wc obtained this probability by simulation. We 
generated 100,000 replicas of e. As t increases, the estimate of the 
true probability is based on less and less points; the simulated curve 
tends to wiggle as t gets large. The theoretical approximation will 
be the smooth curve on the graphs. The parameters involved are 
k, n, a, a. We will only consider the autocovariance of order 1 in 
our simulations. We consider the sample sizes n = 10, which is very 
small, and n = 20, which is a common order of magnitude in some 
applications. We also consider probabilities of interest in applications, 
namely between 10~^ and 10~^. Recall that 5% is about 10~^-^°. 
The results are as follows. 
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Let us first see what happens when the errors have a Cauchy 
distribution, corresponding to a = 1. 

When a = 1, the two plots bellow show that the approximation is 
amazingly good. 




lO'^ 10^ 10^ lO** 10*^ 10^ lO** 10 

n = 10, a = 1, errors Cauchy n = 20, a = 1, errors Cauchy 



For a = 0.5 the approximation is also excellent. 




10" 10^ 10*^ 10^ 10^ 10*^ 10^ 10^ 

n = 10, a = 0. 5, errors Cauchy n = 20, a = 0. 5, errors Cauchy 



In the degenerate case where we need to apply Theorem 8.3.1, the 

coefficient a vanishes. The approximation is not as good as before. 
But taking into account that we are visualizing a relative error, it 
performs well enough to be of practical use. 
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For a = —0.5, the autocovariancc tends to be negative since 
7n(l) /7ra(0) is an approximation of a. Therefore we need to go further 
on the tail to have a good approximation. For n = 10, it is stih 
accurate enough to be of some practical interest. One can use the 
approximation to find critical values at levels less than 10~^-^ ^ 3% 
say. But as n increases from 10 to 20, the accuracy decreases. 



10-1 



10-1 




n = 10, a = —0. 5, errors Cauchy n = 20, a = —0. 5, errors Cauchy 



For a = —1, we need to go much further in the tail of the 
distribution of 7n(l) in order to have a positive quantile. The 

approximation is not accurate in the range of practical interest. As 
n increases from 10 to 20, the approximation cannot be used in 
applications. 
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10' 10^ 10^ 10' 10^ 10^ 

n = 10, a = —1, errors Cauchy n = 20, a = —1, errors Cauchy 



When a is negative, it makes more sense to approximate the lower 
tail. Given what we have done, it is a trivial matter. We state a result 
only in the form needed for our discussion. 

11.2.2. THEOREM. Let X he an autoregressive process of order 1, 
with coefficient a, and errors independent and identically distributed 
from a Student-like distribution with parameter a. If a is negative, 
then 

l^i^n— 1 

as t tends to infinity. 

Proof. With the notation of the proof of Theorem 11.2.1, we need 
to evaluate the upper tail of (— Ce, e). In the proof of Theorem 11.2.1, 
we shown that when k = 1 and o is negative, the matrix —C has its 
coefficients C„,„ vanishing, while Ci^i = a(l — a^("~*))/(l — a^) for 
z = 1, . . . n — 1. Apply Theorem 8.2.1 to conclude the proof. ■ 

There is not much point in reproducing here results on the approxi- 
mation of the lower tail. It is enough to say that it works as expected; 

that is, the approximation is very sharp when a, = —1 or a = —0.5. 
The pictures look identical to those for the upper tail with positive 
coefficient a. 
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As a increases, the approximation given in Theorem 11.2.1 degener- 
ates for positive values of a. For a Student-distribution with 5 degrees 
of freedom, that is a = 5, and n = 10, their use starts to be question- 
able. 




For a = 0, we appealed to Theorem 8.3.1, and the approximation 
is not so good. We would like to point out that this failure can be 
seen on a very simple example. Consider two independent random 
variables, X, Y, with density a/x'^'^^ over [l,oo). We can calculate 
explicitly the distribution of their product, 

P{XY ^t} = a [ x-'^-^P{ Y ^ t/x }dx 

= a [ Al)(ix 

a log t 1 
= — H . 

Our approximation picks up the leading term, at~°' log t as t tends 
to infinity. But the growth of logt is too slow for the first term to 
really dominate in the range where the probability is of order 10~^ 
or 10~^. One cannot expect a good one-term approximation in such 
case, except if a is large. This suggests that when a is zero, our 
approximation may improve when a increases. This is the case. And 
of course, for negative a, it becomes worse, positive values of the 
empirical covariance being less and less likely. 
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IQi IQi-^ IQi'* W^-^ 10-1 1 



n = 20, a = 0, errors Student(5) n = 20, a = —0. 5, errors Student(5) 

For a = 0, the approximation is fairly good, until we have many 
moments on the distribution, that is if a is large enough. Then 
assuming simply that the errors are normally distributed may give 
a better approximation. 

10-1 



io-2__ 



lOi 101-2 lOi-" 

n = 20, a = 0, errors Student(lO) 

When our approximation is not so good, one can think of some 
alternative techniques. Besides the classical Edgeworth expansion — 
which is poor in term of relative error — one could also approximate 
the Student distribution by a normal one, and then proceed as if e 
were normally distributed. This works well if the normal distribution 
has the "right" variance. But one has to be aware that for a small, the 
right variance is not that of the corresponding Student distribution. 
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For instance, for the Student distribution with 5 degrees of freedom, 
the normal approximation using a variance equal to that of the 
Student distribution poor. One needs a much larger variance. I tried 
to approximate the Student distribution by a normal with the variance 
such that some quantile of the normal would be equal to that of the 
Student. This does not work any better, in the sense that there is 
no systematic way to do this kind of calibration. One should also 
be aware that the symmetry in the Student distribution makes our 
approximations less precise. They would be more accurate if the errors 
had a centered Pareto distribution for instance. Obviously more work 
is needed to derive a set of approximations which would cover more 
or less any regime. It is doubtful that a single approximation scheme 
can give satisfactory results under a very broad class of distributions 
for the errors and relatively arbitrary sample size. 

Classically, a is estimated by d = 7„(l)/7„(0). Let us now consider 
the test problem 

Ho : a ^ ao, versus Hi : a > ag. 

A possible way to perform this test is to reject the null hypothesis 
if d„ — ao is too large, that is if 7„(1) — ao7n(0) is too large. For a 
reason which will be explained in the proof of the next result, it is 
better to use 

7„(0)=n-i Yl 

instead of 7n(0). As n tends to infinity, the classical theory ensures 
that it does not make any difference. However, it does make a 
difference for our finite sample results. The theory is much nicer with 

7n(0). 

The test statistics is again a quadratic form in e. The following 

result gives its tail approximation under the null as well as under the 
alternative hypothesis. As one more parameter is involved, another 
behavior appears. 

11.2.2. THEOREM. Let X be an autoregressive process of order 1, 
with coefficient a and independent and identically distributed innova- 
tions, all having a Student-like distribution with parameter a. The tail 
probability 

P{ n(7„(l) - ao7n(0)) >t} 
admits the following equivalent as t tends to infinity. 
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(i) Ifa> ao, 

l^i^n— 1 

(ii) If ao = a and both are nonnegative, 

with the convention that O'' = 1 when a = 0. 
(Hi) If ao IS positive and a < ao, 

c{ao, a, a, n) t~" 

for some function c(-). 

In case (iii), we will explain after the proof how to calculate the 
function c(-) in a typical case. 

Proof. Write n(7„(l) - ao7„(0)) = e^Ce with 

C = A^^BA - aoA^B'^BA . 

From the proof of Theorem 11.2.1 we obtain the diagonal terms 
{A^BA)i^i. We calculate 

{A-B-BA),, = Y: iBA)l = E = • 

Consequently, 

{1 „2(n-i) 
7" ifl^z^n-1 
if i = n. 

If instead of using 7^(0) we use 7n(0), the term {A^^ B^ BA)i^i has 
denominator 1 — a^^""*"^^). The discussion is a bit more involved. The 
result becomes more dependent on n. It is more complicated to state, 
but it does not make much difference as far as the theory goes. 

If a — ao is positive, then all the diagonal coefficients of C but Cn,n 
are positive. We apply Theorem 8.3.1. 
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If a — ao is negative, only Cn,n is nonnegative. We apply Theorem 
8.3.1. Since we will need it, let us calculate Cjj. First 



Therefore, 



if 2 ^ A; ^ n, 
if A; = 1. 



a2n _ ^2{iyj) 



^ otherwise. 
Consequently, a little algebra shows that 

{A^B^BA)ij = {BA)k,iiBA)kj = a^''^ 

In particular, for i < n, 

r ■ — 

while Ci^n = for 1 ^ z ^ n. Therefore 

If a and ao are equal, then all the diagonal coefficients of C vanish. 
In this case, for i, j distinct, 

Ci,i = «' ' ' 1 2 «oal* " 2—^ • 

1 — a a — 1 

If ao is positive and strictly larger than a, then all the diagonal 
coefficients of C arc negative. We need to determine N{C) and apply 
Theorem 8.2.10. The statement follows from Theorem 8.2.21. ■ 

Some useful information can be deduced from Theorem 11.2.2. 
A first qualitative deduction is that if ao is positive under the null 
hypothesis, then the tail probability under consideration has a very 
different decay according to the position of a with respect to cq. Thus, 
one should probably not use symmetric confidence intervals. It may 
be wise to have a somewhere on the right half of the interval. 

Next, assume that we want to test with the risk of first type t/ very 
small. We can use the approximation under the null hypothesis to 
obtain the critical value. Define 



c(a) = 2K.,.a^(a-aor/^ E , 1 - a^(-^) ^ a/2 

l^i^n—l 



11.3. Autoregressive processes of arbitrary order 



255 



We see that if ^ ao < a, then rj ~ c{a)/t°'^^. This gives an 
approximate critical value = {c{a) /t])^^" . 

The following plot shows the actual risk of the first type when using 
the approximate critical value. The "true" value of the risk is obtained 
by simulation, replicating 100,000 copies of the vector e. The curves 
were obtained by linearly interpolating between the following values 
for a. 

0.500 0.552 0.605 0.657 0.710 0.762 0.815 0.868 0.920 0.930 
0.939 0.948 0.957 0.966 0.975 0.984 0.993 1.000 1.002 1.011 
1.019 1.028 1.037 1.046 1.055 1.064 1.073 1.082 1.091 1.100 
1.150 1.200 1.250 1.300 1.350 1.400 1.450 1.500 

The sample size is n = 20. The lowest curve is for a = 1, the two 
almost equal curves are for a = 5 — the lower one of the two curves 
— and a = 10. The value a = 1 is indicated, as well as the levels 5% 
and 10% . 



1 




n = 20, a = l, 5, 10 

The result is satisfying at first glance. A more careful examination 
shows some reason to worry. For a = 1, the actual risk of the test 
is about 10% when a = 5 or 10. This is still small, but in terms of 
relative error, this is twice as much as what we wanted. The fact that 
the power function grows moderately fast with a is not a surprise if 
you plot some of these processes. On a trajectory of length 20, and 
with errors having a Cauchy distribution, an autoregressive process 
of order one with a — 1 looks very similar to one with a = 1.2 for 
instance. 

11.3. Autoregressive processes of arbitrary order. 

In principle, all the results of the previous section can be generalized 
to autoregressive processes of arbitrary order. As more parameters are 
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involved in the model, the analysis is harder. Thus, our goal is rather 
modest. We will prove that the tail of the autocovariance of order 1 
has different decays according to the values of the parameters, and the 
number of observations as well. Since autoregressive processes of order 
one are a degenerate case of those of higher order, the higher order 
autocovariances would have an even more complex behavior. For any 
given value of the parameters, the results of chapter 8 can be used to 
do numerical computations; this is quite easy, and sometimes helpful, 
but does not provide further insights. For autoregressive processes 
of order 2, we will obtain some rather precise results, showing the 
intricacy of these models. 

Let us now consider an autoregressive model of order p as in 
(11.1.1). The parameter 9 = {9i,...,9p) is in W. Our first result 
shows that can be partitioned into two regions, one where the tail 
behavior of Ti,7„(l) is like t^"/^, the other one where it is like t^" logt. 
The noticeable fact is that there cannot be other tail behavior, and 
these regions are nested when the number of observations varies. 

11.3.1. THEOREM. Consider an autoregressive process of order p 
with errors having a Student-like distribution with parameter a. There 
exist nonempty semialgebraic sets R}~, k ^ 1, of MP, and a positive 
function c(-) on W, such that 



c{9)t-/^ ifee[ji^,^n-iRk, 
cie)t-\ogt ife^[ji<k<n-iRk. 



REMARK. The regions Ri depend of course on the dimension p of 
the parameter space. It is remarkable that they do not depend on n, 
though the quadratic form representing n7n(l) does. As n increases, 

Ui<fe<n increases, but we will see during the proof that its limit is 
a proper nonempty subset of M*'. The proof also gives some indications 
on how to calculate these regions efficiently. 

Proof of Theorem 11.3.1. Wc write X — At. The matrix A is more 
involved than in section 2. Let be the i-th vector from the canonical 
basis of M'', that is having all its entries vanishing, except the i-th one 
being 1. We denote Ai^, the i-th row of A. Relation (11.1.1) gives 



M,, = ei 

A2,. = ^1^1,. + 62 
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Ap+i^, = OiAp^, + ^s^p-i,. + • • • + epAi,, + Bp (11.3.1) 

and for p + 1 ^ A; ^ n, 

Ak,, = 9iAk-i,, + e2Ak-2,. + ■■■ + 9pAk-p,. + Bk . (11.3.2) 

Let C = A'^BA be the matrix of the quadratic form such that 
n7„(l) = e^Ce. Since Aj^i = for any 1 ^ j < i ^ n, 

Cn,n = and max Cj^j ^ . 

Thus, Theorems 8.2.1 and 8.3.1 show that the only possible tail 
behaviors of n7„(l) are either like or like t^^logt. They show 

as well the existence of the nonvanishing function c(-). 

Since Aij = Aj+ij+i for all i, j ^ n — 1, we have for A: ^ 1, 

Cn—kjU—k ~ ^ ^ {A ^:fi—i;^iBijAj^ji—i; 
~ Aj^n-kAj-l^n-k 

= (11-3.3) 

i<ij<ik 

Relations (11.3.1) and (11.3.2) show that Aj^i is a polynomial in a 
and 6. Hence Cn-k,n~k is also a polynomial in a and b. The region 

= { (a, 6) e R2 : C„_fe+i,„_jt+i ^ } 

is then a semialgebraic set, which does not depend on n. Moreover, 
the largest diagonal coefficient of C is positive if and only if (a, b) is in 
Ui^ifc^n ^k- The result then follows from Theorems 8.2.1 and 8.3.1. ■ 

In practice, to decide in which region we are, we can numerically 
compute the diagonal terms of C with formulas (11.3.1)-(11.3.3). But 
if one ultimately wants the constant c(a, 6), other elements of C are 
needed when the tail is like t^" log t. 
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The sets Rk do not seem easy to describe in general. For applica- 
tions, this may not be so important, since numerical computation is 
easy to implement. Understanding their geometry amounts to under- 
standing the behavior of the roots of inductively defined polynomials 
in the p variables 9i,. . . ,0p. More can be said when p = 2, because 
polynomials of degree 2 are well understood. And this is enough to 
show how intricate these autoregressive models are. Thus, from now 
on, we focus on autoregressive models of order 2. We change slightly 
the notation, using (a, 6) instead of (^1,^2)- Thus, our model is 

Xi = ei, 

X2 = aXi + €2 , 

Xk = aXk_i + bXk_2 + ek, 3 ^ k ^n. 

We can explicitly write down R2, R3, R4. Indeed, Cn,n = and 
thus i?i = 0. We then have 

Cn-l,n-l = ^2,1^1,1 = a, 

thus 

R2 = {{a,b) : a > } . 
Since ^13,1^2,1 = (a^ + b)a, 

Cn-2,n-2 = {o^ + b)a + a = a{a^ -I- 6 -I- 1) . 
Consequently, 

i?3 = { (a, 6) : a > and b> —1 — c? ; or a < and 6 < — 1 — } . 

But what matters more, 

i?2 U i?3 = ((0, 00) X M) U { (a, 6) : a < , 6 < -1 - } . 

The next figure shows these regions. 

We also obtain ^4^1 = a(a^ -|- 26), which leads to 

Cn-3,n-3 = a{a^ + 26) (a^ + b) + a(a^ + b + l) 
= a{2b^ + 6(3a^ + l) + a^ + + l) 

To obtain i?2 U i?3 U i?4, we need to see when C„_3^„_3 is positive as 
a is negative. In other words, when a is negative and 



26^ + h{2,a^ + 1) + 1 + + < . 
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This holds if a < ^ and 



with 



Thus, 



b± = 



-(1 + 3a2) ± Va^ - 2a2 - 7 



i?i U i?2 U i?3 U i?4 = ((0, oo) X R) lj{ (a, 6) : a < , 6 < -1 - } 

1 - 2^2 



[j{{a,b) : 



a ^ 



; 6_ ^ 6 ^ 6+ } . 




Regions R2, R2 U R3, R2UR3U Ra- 

To calculate i?5, we need to solve a cubic equation in b. Closed 
form expressions are getting more and more cumbersome, and even 
nonexistent. The following picture shows Ui^fc^n -^fe ^^^^ ~ 5,6,7 
and n = 10 in the domain a negative. 
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b b 




n = 7 n = 10 



These pictures can be more or less understood theoretically. This 

is the purpose of the next result. Its proof contains even more 
information, some of it being important, and we will discuss further 
after the proof. 
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11.3.2. THEOREM. The closure o/|J^>]^ Rk contains all points (a, 6) 
for which one of the following conditions holds: 

(i) a > 0, 

(ii) a ^ and b < —a} — 1, 

(Hi) a ^ and b < min(— a^/4, —a — 1). 

The region described by the three condition in Theorem 11.3.2 is 
shaded gray in the following picture. It does not contain its boundary. 

b 




b = b=-a^-l 



It follows from Theorems 11.3.2 and 11.3.1 that if {a,b) lies in the 
gray shaded region, the tail behavior of P{n7n(l) t} is typically 
like for n large enough. I believe that in the nonshaded domain, 

the tail behavior is like log t; we will prove this only when b ^ 

Proof of Theorem 11.3.2. The proof will be done in examining 
different regions. We will need several lemmas, and will actually prove 
much more than the statement. 

11.3.3. LEMMA. If a is positive, so is the largest diagonal coefficient 

of C. If a is nonpositive, then the largest diagonal coefficient of C 
vanishes. Consequently, Ufc^i contains the region a > 0, but does 
not intersect the region a ^0 andb^ -a'^/A. 
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Proof. If a is positive, then Cn-i,n-i = a is positive too. To see 
what is happens when a is nonpositive, denote by u, v the roots of the 
characteristic equation — ax — b = 0. Equations (11.3.1)-(11.3.2) 
yield 

with initial condition Ai^i = 1 and = a. Thus r and s are 

determined by 

r + s = 1 and ru + sv = a . 
If u and V are distinct, that is 6 / — 



— 



u — V 



If 6 = -a? /A, we find 



j(a/2) 



Consequently, if 6 7^ —a'^/A, 



{u — v)"^ 



(11.3.4) 



Notice that 



{u - v f = (u + v f - Auv = + 46 . 
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Let us now assume that a is negative and h is positive. Then + 
is positive. There is no loss of generaUty in assuming u <0 <v since 
the product of the roots, —6, is negative. The inequaUty 



u" — V = {u + v)(u — v) = —a\l + 46 ^ 



,2 „,2 



forces |u| ^ \v\. Consequently, the sign of Aj+i^iAj^i is that of 

which is negative. Therefore, the sequence decreasing 
and 

max Ck,k = Cn,n = • 

If we now assume that a is negative and — < 6 ^ 0, the roots 
u, V are still real, but both are negative. Thus 

^^^^'^"^^'^ ^^Tl6 

is nonpositive since the function x and a; i-^^ x^'^^ are increasing 
on R"*". The sequence i— > Cn-k,n-k is decreasing and its maximum 
is C„^„ = 0. This gives Lemma 11.3.3. ■ 

The region left is a ^ and 6 < — a^/4. Our next lemma covers a 
part of it. 

11.3.4. LEMMA. // a ^ and b ^ —1, then the largest diagonal 
coefficient of C vanishes. Therefore, Ufe^i -^fe does not intersect the 
region a < and b ^ —1. 

Proof. If + 46 is positive, the result follows from Lemma 11.3.3. 
If + 46 is negative, equation (11.3.1)-(11.3.3) gives 

Cn-k,n-k = ^2,1^1,1 + ^3,1^2,1 + Yl ('^^J-l + bAj-l,l)Aj 

= bCn-k+l,n-k+l + a ^ A^ . 

l^^k 

Consequently, for ^ 2, 

Cn-k,n-k = b'^Cn-k+2,n-k+2 + a{b + 1) ^ Aj + aAl . 
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Thus, if 6 ^ — 1 and a ^ 0, 

Cn-k,n-k ^ ^^Cn-fc+2,n-fc+2 • 

Since C„,„ = and Cn-i,n-i = a < on the given range, this shows 
that Cn-k,n-k ^ for all 0. ■ 

To study the domain a ^ 0, 6 ^ — 1 and b < — is much more 
complicated. We assume from now on, and until the end of the proof 
of Theorem 11.3.2, that (a, 6) is in this domain. It is then convenient 
to make a change of parameterization, setting 

a = -2rcos^, 6 = -r^ r^O, 0^(/)^7r/2. 

First, this allows us to obtain a closed formula for the diagonal 
coefficients of C. 

13.3.5. LEMMA. If a = —2rcos(j) and b = — r^, with r nonnegative 
and cp in [0,7r/2], then 



Cn-k+l,n-k+l — 1" 

+ r 



-2(l-r2*^)cos(/)sin2</) 

2(fc-i) (1 _ ^2-) gin(A;</,) sin {{k + 1).^) - sin {{k - 1)(/))) 
(1 - r^)((l - r^f + Ar^ sin^ cf) sin^ cj) . 



Proof. Write u = re and v = re for the roots of the 
characteristic equation — ax — b = 0. Setting (p = tt — 6, we 
obtain 

a = u + v = 2r cos 9 = — 2r cos (f) 
b = —uv = — . 

This is the origin of the parameterization. Equations (11.3.3) and 
(11.3.4) give for k ^ 2, 

sr^ -{u + v){uvy + 

Cn-k+l,n-k+l - 2^ / _ )2 

1 / ol -u2(^-l) ^ ^ 1- (to)*^-1 
= "2 U — ; 2 [U + V)UV 

1 _ t,2(fe-l) . 
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In this last expression, a part is independent of A;. It is the ratio of 

u^{l-uv){l-v'^)-{u + v)uv{l-u'^){l-v'^) + v^{l-u'^){l-uv) 
= {u — v)'^{u + v) = — 4r^ sin^ 6 2r cos 9 

and 

{u - vf{l - u^){l - v'^){l - uv) 

= {u-vf{l-uv){- (u + + {1 + uvf) 

= - (2r sin ef{l - r^) ((1 + r^)^ - 4r^ cos^ 6) 

= - sin^ e{l- r^) ((1 - r^)^ + sin^ 6) . (11.3.5) 

For the part dependent on k, we reduce it to the same denominator, 
(11.3.5), and obtain the numerator 

(1 -uv){- - ^;2fc+l + „2^2(^2fc-l ^ ^2fc-l)) 

+ (n + v){uv)''{l - n2)(l - ti2) 

= -(1 - r2)(2r2'=+i cos(2A: + 1)9 - r*2r'^^-^ cos(2A; - 1)6) 

+2r cos 6* r^^ ((1 - r^) + 4r2 sin^ 9) . 

Adding the part independent of k and that dependent of A;, we obtain 
the numerator 

-8{r^-r'^''+^)sm'^9cos9-2r'^''+\l-r'^){cos{2k+l)9-r'^ cos{2k-l)9 

- {l-r^)cos9) 

= -8(r^ - r'^^+^) sm'^9cos9 + Ar'^''+\l - r^) sinfc9(sin(fe + 1)^ 

-r2sin2(A;- 1)^) . 

Consequently, 

Cn-k+i,n-k+i = r 2(1 - r^'') sin^ 6'cos6' 

-r'^(^-^){l-r^)smk9{sm{k + l)e-r^sm{k-l)9) j 

(1 - r^) ((1 - ?f + 4r2 sin^ 9) sin^ 9 . 

The change of angle 9 = it — (j) gives the result. ■ 

In the domain a ^ and 6 < — 1 with < — a^/4, that is r > 1 
in our (r, 0)-parameterization, Lemma 11.3.5 shows that the sign of 
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Cn-k+i,n-k+i is that of minus its numerator. Thus, it has the same 
sign as 

2(1 - r^'^)cos0sin^^ 

_ _ r'^-js[Tak(j){sm{k+ l)4> - r'^ sm{k - 1)(/)) . 

In other words, the sign of Cn-k+i,n-k+i is that of 

2 cos ^ srn^ (j) + r^^^-^^ gkir"^) (11.3.6) 

where 

gk{s) = —sm(k4>) {sin kef) cos 4> + cos k4> sin 4>) 

+ s{—2 cos (f) sin^ (p + 2 sin^ k(f) cos (p) 
— sin A;0(sin kcp cos cj) — cos kcp sin cj)) . 

We can now explain how to conclude the proof. As k tends to infinity, 
the leading term in (11.3.6) is r'^i''^^^ g{r'^), since we assume r > 1. 
Thus, for large k the diagonal coefficient Cn-k+i,n-k+i is positive 
whenever ^^(r^) is such. Thus, our goal is to determine for which 
values of (s, ^) we can have gk{s) positive for infinitely many fc's. 

The trick is to understand that when cj) is an irrational multiple 
of 27r, the sequence sinA;^ fills [ — 1, 1]. Thus, we can consider sinA;^ 
almost as a free parameter, on which we can optimize. This leads us 
to define the function 

h{9) = — sin (sin 9 cos (/) + cos 9 sin (/)) 

+ s(— 2 cos (/) sin^ (j) + 2 sin^ 9 cos (j)) 
— sin 9{sin 9 cos 4> — cos 9 sin (j)) . 

Formally, this function is obtained by substituting kcj) for 9 in the 
expression of gk- It is convenient to define 

^ = (s — 1)^ cos^ , B = {s^ — 1) sincj) , and C = 2s sin^ (/!) cos ^ . 

Since we implicitly made the change of variable s = after (11.3.6), 
the numbers A, B and C are all positive — recall < 4> < ir/2 and 
r > 1. 

The following result will be instrumental. 
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11.3.6. LEMMA. The function h{9) is maximal at a point 9* , unique 
modulo TT, and defined by 

cos(2r ) = A/^/A^Tb^ , sm{2e*) = Bj^A^ + B'^ . 

Its maximum value is 



Proof. We rewrite the function h{-) as 

h{9) = -A sin^ 9 + BsmdcosO - C 

1- cos 26* „sin26' _ 

= -A + B— C. 

2 2 

Differentiating with respect to ^, we see that when h is maximum, 

= h{9*) = -A sin 29* + B cos 29* . 

Since neither A nor B vanish, this gives us tan^* = B/A. Conse- 
quently, there exists ei, 62 equal to either —1 or +1, such that 

cos2r = eiA/y/A^ + B^ and sin2r = e2B / + B^ . 

At such point, the value of /i(-) is 

hi9*) = -^ + '-4±^-C. 
^ ' 2 2^WTW- 

It is maximum when e\ = 62 = 1. This determines 29 modulo 27r, and 
therefore, 9 modulo tt. ■ 

We can now determine when the maximum of /!,(•) is positive. 

11.3.7. LEMMA. In the domain a < and 6 < -1 with b < -a^/A, 
the function h{-) has a positive supremum if and only if b < a — 1. 

Proof. Using Lemma 11.3.6, the positivity of the supremum of /i(-) 

is equivalent to 

\/^2 + B'^> A + 2C . 
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Since A, B and C are positive, this is equivalent to > 4C^ + AAC. 
Plugging the expression for A, B and C into this last inequality, we 
obtain 

(s^ - 1)2 > 16s2(l - cos^ (f)) cos^ (/) + 8s(s - 1)^ cos^ </> . 
Setting c = cos^ (f), we obtain a quadratic inequality 

IGs^c^ - 8s(s2 + l)c + (s^ - 1)2 > . (11.3.7) 
The quadratic function of c involved has two positive roots, 

c_ = — and c+ = — . 

4s 4s 

Since c+ > 1 (recall s = > 0), inequality (11.3.7) is equivalent to 

cos^ <p < C- . 

Going back to the parameterization a = — 2rcos0 and b = — r^, that 
is = 4scos2 cf) and 6 = — s, we rewrite the above inequality. After a 
simplification by 4s, it gives 

Since 6 < — 1 and a < 0, it is equivalent to 6 < a — 1. ■ 
We can now state our final lemma. 

11.3.8. LEMMA. Assume that a <0 andb < min(-a2/4, -1). Ifb < 

a—1 and (p is an irrational multiple of 271, then limsup^,^3Q Qkif"^) > 0. 
On the other hand, if b > a — 1, then there exists a positive e such 
that gkir^) ^ — e for all k ^ 1. 

Proof. Assume b < a — I. Combining Lemmas 11.3.7 and 
11.3.8, let e be a positive number such that h{9) is positive on an 
e-neighborhood of 6*. If (/) is an irrational multiple of tt, the sequence 
k(p intersect [9* — e,6* + e] + 27rZ infinitely often; this follows from 
Kronecker's approximation theorem in number theory — see, e.g., 
Hlawka, SchuiBengeier and Taschner (1986). Consequently, 

lim sup gkir"^) = h{9*) > . 

fc— >oo 
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If b > a— 1, then h{-) is a negative function. Since gk{r'^) = h{k(j)) ^ 
h{0*), the result follows. ■ 

To conclude the proof of Theorem 11.3.2, we still assume a < 
and h < min(— — 1). lib < a — 1, Lemma 11.3.8 shows that 
whenever (j) is an irrational multiple of vr, the limit superior of (11.3.6) 
is +00. For such values of 0, the pair (a, 6) is covered by infinitely 
many regions R^- Thus in the range h < a — 1, the only regions 
not eventually covered by Ufc^i -^fe those for which ^ is a rational 
multiple of vr. After the change of parameterization, the complement 
of this potentially uncovered set is dense in 6 < a — 1. 

If 6 > a — 1, then (11.3.6) is less than 

2 cos(/)sin2 (j) + r2('=-^)/i(r ) , (11.3.8) 

which tends to — oo as k tends to infinity. Therefore, the pair (a, b) can 
be covered by at most a finite number of regions Rk. This concludes 
the proof of Theorem 11.3.2. ■ 

Notice that we proved much more than the statement of Theorem 
11.3.2. When a < and b > — Lemma 11.3.3 shows that no 
region R^ covers (a, 6). 

The proof of Lemma 11.3.4 also contains useful information. If 
a < and b > — a^/4, the sequence k Cn-k,n-k is decreasing. 
Thus, the constant c{a,b) in Theorem 11.3.1 and given by Theorem 
8.3.1 is 

c{a,b) = Kl^a^ \Cn,j + Cj,nr. 

This expression simplifies further if one notices that 

Thus, Cn,j = An- j -2,1 for 1 ^ j ^ n — 1. Moreover 
Cj,n = for all 1 ^ j ^ n . 

Consequently, 
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In the range a < and a — 1 < 6 < — the proof of 
Lemma 11.3.8 shows that at most a finite number of regions cover 
(a, b). Notice that the bound (11.3.8) shows that the number of such 
covering regions is at most the largest k for which (11.3.8) is positive. 
Ultimately, this gives an inequality in k, a and b. Potentially, this 
could be used if someone were interested in proving some result for 
particular values of a and b. However, the pictures above suggest that 
no region Rk covers such pair a (a, b); but I don't know how to prove 
it. 

The proof of Lemma 11.3.9 involves a number theoretic argument 
which does not say what happens when 4> is a rational multiple of 
TT. The pictures of the regions Ri below leave the possibihty that 
some exceptional parabola a = — rcos0, b = —r^ with (j) G 27rQ 
are left uncovered. EqTiation (11.3.6) shows that no parabola is left 
completely uncovered. Indeed, as r tends to infinity, the sign of 
Cn-k+i,n-k+i is that of sin kcp sm{k — l)(p. We claim that the sequence 
smk(j)sm{k — contains infinitely many positive values whenever <p 
is in (0,7r/2). Indeed, if ^ is in (0,7r/2), then k^ and {k — l)<p are 
less than 7r/2 apart. When (j) is a rational multiple of vr the sequence 
{k(t))k^i is periodic modulo 2tt. Consequently, for some fc, both /c0 
and {k — 1)4) are in (0, vr) modulo 27r. For this specific k we have 
sin kef) sin(A; — !)(/> > 0. 

1 conjecture that Theorem 11.3.2 is sharp, meaning that the region 
described in (a, b) coincides with (Jfe^i -^fe- 

Combined with our description of i?i = { (a, 6) : a > }, Lemmas 
11.3.4 and 11.3.5 allow us to describe completely what happens in the 
stability region. 

11.3.10. THEOREM. For the second order autoregressive process, 

Xn = aXn-1 + bXn-2 + ^n, assume that the roots of the characteristic 
equation — ax — b = are inside the unit disk. Then the tail of 
n7n(l) has the form 

P{njn{l) > t} ^ \ ' ' ast^oc. 

I c{a, b)t " logt ^/ o < 

In particular, in the stability region, the tail of n7n(l) behaves 
like if a > and like t~"logt if a < 0. In some sense, this 

generalizes Theorem 11.2.1 to autoregressive processes of order 2. If 
6 = we recover Theorem 11.2.1. 
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Notes 

A basic reference on the classical theory and applications of time series 
is Brockwell and Davis (1987). 

For linear processes with heavy tailed errors, Davis and Resnick 
(1986) developed the asymptotic theory as the number of observations 
goes to infinity. A slightly different perspective, first letting n tend to 
infinity, and then looking at the tail of the limiting distribution has 
been investigated in a series of papers by Mijnheer (1997a, b, c). 

When the errors ej's have a nearly symmetric distribution and 
enough moments, it is tempting to work as if they were from a Weibull- 
like distribution, or even from a normal one. In the later case, the 
autocovariances are weighted sums of chi-square random variables. I 
don't know how to assess the accuracy of such an approximation. 

I somewhat believe that a proper understanding of the regions Ri 
in general requires adding some algebraic geometric tools. Cox, Little 
and O'Shea (1992, 1998) may be a good starting point for statisticians 
interested in pursuing this research path. I don't know what is the 
analogue of Theorem 11.3.2 for autoregressive models of order larger 
than or equal to 3. 

One may think that our choice of heavy tailed errors is the origin 
of the complicated tail behavior of the autocovariances. If the Cj's 
have a spherical distribution, we can use the classical Gaussian trick 
of diagonalizing the matrix of the quadratic form. When looking at 
the tail behavior, one is then led to the nontrivial question of relating 
the dimension of the largest eigensubspace of the matrix C to the 
parameter 9 of the autoregressive model. A plot of the spectral gap 
for the matrix C as a function of a and b — thus, for autoregressive of 
order 2 — suggests an incredibly complex behavior of the tail of the 
the aucovariance, even with Gaussian errors. 



12. Suprema of some 
stochastic processes 



In chapter 9, we studied how the tail of the distribution of the 
supremum of a random hnear form is related to integration over some 
asymptotic sets. The goal in this chapter is to go a little further, 
considering some examples which are of pedagogical interest. 

12.1. Maxima of processes and maxima of their variances. 

Consider a centered Gaussian process X{m) indexed by some abstract 
set M. As in chapter 9, write 

X{M) = sup{X{m) : me M} 

for its supremum. Define 

(T^(M) = sup{ VarX(m) : m € M} 

to be the supremum of its variance. A famous result of Fernique 
(1970), Landau and Shepp (1970) asserts that whenever a^{M) is 
finite, 

limt-logP{X(M)^n = -^- (12.1.1) 

This result is often interpreted in saying that, in the logarithmic 
scale, the tail of the supremum of the process is driven by the points 
of largest variance. A heuristic argument of why this should be the 
case is that when the variance is large, the process tends to fluctuate 
more. And so, we should expect its supremum, when large, to be near 
such a point. As it is, this heuristic argument could be applied to 
any process. The aim of this section is to show that this heuristic is 
wrong. 

We are going to construct some processes whose supremum tail 

is driven by the points of smallest variance. This is an application 
of ideas developed in chapter 9. The key is to understand why 
the maximum variance appears in the Gaussian case. Proposition 
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9.2.1 tells us that this happens because /. is proportional to l/|a;p 
in the Gaussian tends to infinity. That is /. is inversely 

proportional to the Euclidean norm — a very specific feature. And 
the Euclidean norm is a monotone function of the variance. Thus, 
to build a counterexample to the heuristic, we need to have a set M 
and a function / such that /, is minimal on M at points of minimal 
Euclidean norm. 

Let a be in (1,2) and 9 = {T{l/a)/T{3/a))"^^ . The function 

fa{x) = " =e-^l^l" , X e M , 

2yr(l/a)r(3/a) 

defines a density. It has zero expectation, and unit variance. For any 
positive P, define the unit sphere for the ^^-norm 

Sf^ = {xeW^ : \x\i3 = 1 } . 

If /? > 1, define also 

Mf3 = { sign{mi)\mif-^ei : m e sf^ } . 

12.1.1 THEOREM. Let X = {Xi,...,Xd) be a random, vector in 
with independent components, all distributed with density fa- If 

1 < a < p < 2, then 

limf-" log P{X{Mp)^t} ^ 



t— >oo 



inf^gM^ VarX(m) 



Proof. For x in W^, let I{x) = Yli<i^d^'^Sf(^i)- some constant 
c, 

I{x) =0 \xi\" + c. 

This is a strictly convex function since a > 1. It satisfies (9.2.1). 
Moreover, Lemma 9.1.9 gives 

^•(^) =C+I^la/Vl)- 
Consequently, as t tends to infinity, 

I,iMp/t) ~ i-"/sup { J2 -.xeMp 
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If m belongs to , then each component \mi\ is less than or equal 
to 1. Then, the inequality (/? — > (a — 1)/q: gives 

with equality if and only if m has exactly one coordinate equal to 1 
or —1. Thus Proposition 9.2.1 yields 

lim log P{ X{M) ^ t } = -1 . 

t— >oo 

Next the variance of X{m) is \m\^. But if m belongs to Mg, the 
inequality |mp ^ 1 holds, with equality if and only if one of the 
components of m is 1 or —1, and all the others are zero. This concludes 
the proof. ■ 

The proof we gave is reasonably short, but a bit mysterious. For 
d = 2, the following picture makes the result obvious if one keeps in 
mind Laplace's method. It represents the upper right quadrant of the 
plane, and the various sets involved, for a = 1.2 and (3 = 1.5. The set 
on which the integration is performed is shaded gray. Its boundary 
is the polar reciprocal of M^. The set Af^ is the black line inside 
the shaded area. The Euclidean unit sphere is the white line inside 
the shaded area. The last black line is the level set of the density 

fa{x)fa{y)- 




1 



Notice that for d = 2 we have a process indexed by the one 
dimensional set M^. For d > 2 we have a random field. More 
importantly. Theorem 12.1.1 may be a prototype for a misleading 
statement! The proof shows that the left hand side of the statement 
has no intrinsic connection with the right hand side! The proof shows 
that indeed the variance appears by a pure coincidence. In my opinion, 
the Gaussian case is not any different, as Proposition 9.2.1 shows. 
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12.2. Asymptotic expansions for the tail of the supremum of 
Gaussian processes can be arbitrjirily bad. 

For many reasons, both theoretical and applied, there has been a 

large literature devoted to the approximation of the tail probability 
of the supremum of Gaussian processes. In view of (12.1.1), it is quite 
natural to search for an approximation of the form 



P{X{M) ^ t} = Texp (2^2?^)) {Pn{llt)+o{t-^)) as t ^ 00 



where P„ is a polynomial of degree n. The hope of course is that for 
moderate t's, this expansion provides an accurate approximation. 

The aim of this section is to provide an example where such 
expansion holds, but, no matter what, provides a poor approximation 
for fixed t. It is essential to remember that an asymptotic expansion 
like (11.2.1), if it exists, is unique — see, e.g., Olver, 1974, §1.7. In 
particular, it does not depend at all on which method is used to derive 
it. Thus, the failure we want to describe is not that of a particular 
method. The one proposed in these notes as well as any other fails, 
and there is no way arround if one sticks to approximations of the form 
(12.2.1). The basic idea in this section is to mimic what happened for 
autoregressive models. Our approximation was not so good when we 
appealed to Theorem 8.3.1, because it was actually quite likely that 
the largest random variable was not an ej for which Cj.j = 0; even if 
one conditions by the appearance of a large deviation, it is quite likely 
that it is caused by a large for which Cj^j = 0. 

To build our example we will first give it in a geometric form. We 
will discuss afterwards some of its features. 

Consider the convext set in R*^, 



Denote by M the polar reciprocal of dC. Let X be a random 
vector in M*^, having a centered normal distribution, with independent 
components. Then 



(12.2.1) 



C = {p: (fJ,ei) ^l}f|{p : |p|^l + e}. 



X{M) ^ t if and only iiX^tC . 



Define the polynomials 
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12.2.1 PROPOSITION. The tail expansion 

P{ X{M) ^t} = iPn{l/t) + o(r")) ast^oo 



holds. However, for any t ^ 0, 



P{X{M) ^t}^ 2v^e-*'(^+(^'/2))i'^-(V2) 

Pn{j-/t) 



Before proving this result, let us see why this provides the proper 
example. The first statment lets us hope that e~* /'^Pn{'i./t)/y/2Trt is 
a good approximation of the tail probability of the supremum. The 
second statment asserts that it is not the case if e is small and t is large, 
but not too large. The following plot shows the lower bound for d = 2, 
n = 2 and various values of e, as a function of the approximation. 
For instance, if e = 1, the lower bound is less than 1, which means 
that the approximation may underestimate. A more interesting value 
is for e = 0.5; when the approximation is about 10^^'^ ~ 3%, the 
lower bound is about 2. Thus, the approximation underestimates the 
correct probability by a factor at least 2. One should keep in mind 
that for a typical statistical application, we are interested in t's such 
that P{X{M) ^t}is between 10"^ and IQ-^. 




We will comment further on this example after we prove our 
statement. 
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Proof of Proposition 12.2.1. We first start with the obvious bound 

P{ X{M) -^t) ^ P{Xi^t ov\X\^ t{l + e) } 



^P{X^^t} + P{\X\^ t{l + e) } . ^^^-^-^^ 
The crude logarithmic estimate of Proposition 2.1 shows that 
P{ \X\ ^ t(l + e) } = 0(e-*'(^+^)'/2i2<i) as t ^ oo . 

On the other hand, the standard asymptotic expansion for the com- 
plementary error function (see, e.g., Olver, 1974, §3.1.1) yields 

-t^ /2 

P{Xi^t} = ^^(F„(l/t) + o(r")) asi^oo. 

V 27rt 



Thus (12.2.1) provides the asymptotic expansion in the first assertion 
of Proposition 12.2.1. 

We now use another lower bound, namely 

P{X{M)^t} ^ P{\X\^t{l + e)}, 

which follows from the equality in (12.2.1). Since \ X\'^ has a chi-squarc 
distribution with d degrees of freedom, an integration by parts yields 

f-oo |_i a;/2 

P{\X\^t{l + e)}= ^W^TTTf^a; 



t2(i+,)2 2<^/2r(c//2) 
^2t'^-^(l + e)''-^e-*'(i+^)V2. 

For any it ^ 0, the bound Pn{u) ^ 1 holds; this comes from the 
fact that the asymptotic expansion for the error function is obtained 
by integrating by parts, and the integrations lead to an alternating 
series — see, e.g., Olver, 1974, §3.1. Thus the second statement of 
Proposition 12.2.1 follows. ■ 

When d = 2, we can make a very explicit construction of the 
process. The polar reciprocal M is just a piece of circle and a point, 

M={(x,y)GR2 : x"" + = {1 + e)'^ ■,x ^ {1 + e)'^ } U {(1,0)}. 



This can be seen by a formal proof, but it is obvious from the following 
picture. The domain on which we integrate is shaded. The unit 
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sphere, or equivalently the level set of the Gaussian measure, is the 
dark sphere. The set M is an open arc, in black as well, with the 
point ei marked. 




We choose M to be reduced — see definition in section 9.1. We 
could as well index the process by a larger set, such as 

(l + e)-i5iU {(1,0)} 



or even 



(l + 6)-i5iU [(l + 6)-iei,ei]. 
This last set can be parameterized as follows. Let 

( ( cos(27rt), sin(27rt)) if ^ t ^ 1 

The corresponding Gaussian process is 

{Xi cos(27rt) + X2 sin(27rt)) if ^ t ^ 1 



X{t) 



Its variance is 



et + l-e V 
1 + e ^1 



if 1 ^ t ^ 2. 



VarX(i) 



if ^ t ^ 1 



Now one can argue that our example is specific; the maximal variance 
is achieved when t = 2, that is on the boundary of the domain. Well, 
we can always define 

- / ^(^) if ^ t ^ 2 
~ \x{A-t) if 2 ^ t ^ 4. 
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This process has maximum variance at t = 2. And certainly, one could 
argue that CoY{X{t), X{s)) = 1 if 1 ^ s,t ^ 2, and thus this process 
is pathological. This argument can be also ruled out by perturbing 
each coordinate Y{t) by some very tiny multiple of a brownian bridge. 

The moral of the story is that the asymptotic expansion should not 
be worked out blindly. One should certainly make a careful study of 
the covariance of the process and be very cautious when the variance 
does not vary much. Notice that the ratio of the maximal variance of 
the process to the minimal one is (1 + e)^. For e = 0.3, this is 1.69, 
which is not that small. Going back to the lower bound in Proposition 
12.2.1, notice that the polynomial term in the lower bound is of order 
^d-(i/2) ^ ^j^g dimension d increases, the asymptotic expansion gives 
a worse approximation. It can be arbitrarily bad by just taking d 
large enough. Therefore, the constancy of the variance should be 
measured with respect to the dimension. It is therefore very unclear 
what happens in large dimensions or even in infinite dimensions. It 
is also unclear how to assess a priori the approximating quality of an 
asymptotic expansion in this context. 

12.3. Maximum of nonindependent Gaussian random vari- 
ables. 

When one wants to simulate numerically a Gaussian process, there is 
not much choice other than to discretize it. To what extent can we 
obtain an approximation of the distribution of the original process by 
that of the corresponding discretization? There is no claim that this 
section brings some new result. The one we are going to prove now can 
be derived from others existing in the literature. But its derivation 
may be of pedagogical interest. 

Having in mind a discretized process, let X be a Gaussian vector 
in W^, with mean and definite positive covariance matrix S. Let 
o"^ = maxi^j^rf Sj^j be a largest diagonal element. 

12.3.1. THEOREM. Let X have a Gaussian distribution, centered, 
with positive definite covariance matrix S . Then 

P{ma^Xi^t}^ ti{i : ^i,i = a^}. 

Proof. We apply Theorem 7.1. Define 

At = tAi =t{x eR'^ : max ^ 1 } . 
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Set 

I(x) = -x'^^-^X. 
This is a convex function, homogenous of degree a = 2, and 

.-SS^- ^ " = (2.)^/^(detS)V. I, 

To apply Theorem 7.1, we need to minimize / over A\. Since S 

is symmetric, we can diagonahze it and write S = QDC^ with D 
diagonal and Q orthogonal. The change of variable x = Q^j/ shows 
that ^ 

/(Ai) = inf -I —x^Yr^x : max Xi ^ 1 > 



inf I ly^Dy : max (y, g'^ei) ^ 1 1 . 



Writing the Lagrangian to optimize y^D ^y subject to the constraint 
(y, Q'^ei) = 1 and optimizing over i, we obtain 

. N 1 • 1 1 1 

HAi) = — mm -jp = = — - . 

2 i^i^d el Dei 2 maxi^j^^ Sj^j 2(7^ 

Moreover, /(^i) is achieved for the points y = DQ'^ei/a^, or equiva- 
lently x = llei/a^. Thus, the dominating manifold for Ai is 

T>A^ = I — ^ : i such that Sj^j = | . 

It is of dimension k = 0. Since D7 = Theorem 7.1 yields 

P{ max ^ t } = > -— 

i^Kd ^^/2^(det S) V2 . ^^^^ (^^^ G^, (Se^/a^)) 

as t tends to infinity. We need to calculate Gai ■ As mentioned after 
the statement of Theorem 7.1.1, it is obtained as the compression of 
the difference of two second fundamental forms. The one for dAi 

vanishes since dAi is locally a flat hyperplane. That for the level set 
of / is D^//|D/|. At Sei/c7^, its value is c7^E~^. The tangent space 
at dAi at this point is 

{BI{J:e^/a')}^ = {e^/aY = ei. 

Thus, GAiC^Gi/a^) is the expression of a^E~^ to ej-. Therefore 

detGAi(Sei/a^) = a^('^~^)det (S~^efe, e^) i<fc,i^d . 
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Thus, it is (T^('^~^) times the determinant of the (i, z)-cofactor of T,"^, 
which is Ej^jdetS"^. Consequently, 



(det S) V2det (Sei/a2)V2 
and this gives the result putting all the estimates together. ■ 

The fact we now want to stress is about discretizing Gaussian 
processes to simulate the distribution of their maximum. Since the 
maximum of the discretization is less than the maximum of the original 
process, this can only give a lower bound. On the far tail. Theorem 
12.3.1 asserts that this lower bound must be of order ae"* /^'^ /t\/27r. 
This implies two things. First, one should include the points of largest 
variance in the discretized sequence. This is almost common sense. 
Second, in theory, the far tail will be well approximated only if that 
of the original process behaves like cre~* /^'^ /t\/27r. These processes 
have been characterized by Talagrand 



12.4. The truncated Brownian bridge. 

The Brownian bridge B on [0, 1] is a centered Gaussian process with 
covariance 

F.B{s)B{t) = st - s M . 
It is a classical result that it admits the Karhunen-Loeve expansion 

, a/2 v-^ sin(A;7rs) 

where the Xfe's are independent, normally distributed random vari- 
ables. It is known — see, e.g., Billingsley, 1968, §11 — that 

P{ sup ^ t} = . (12.4.1) 

The aim of this section is to obtain an approximation for the tail of 
the supremum of the truncated series 

l<k<d 



This example is quite interesting, because we will see that Theorem 
5.1 — or equivalently. Theorem 7.1 — does not apply. However, a 
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slight change in the arguments will allow us to obtain the desired 
asymptotic equivalence. 

Let us first explain why Theorem 7.1 does not apply, and, in 
particular, why assumption (7.5) — or assumption (5.3) if one uses 
Theorem 5.1 — is not satisfied. Recall that ei,...,ed denotes the 
canonical basis in W^. Define the curve 

, , \/2 sinffevrs) 

in W^. Introducing the Gaussian vector Y = {Yi, . . . ,Yd,), we have 
Ba{s) = {Y,p{s)). Define 

At = {yeR'^: sup {y,p{s)) ^t}^tA^. 

Let I{y) = |y|V2. Then 

P{ sup Ba{s)^t} = [ e-^(^) dy . 

o^s^i (27r)'*/^ J At 

To find the dominating manifold. Proposition 9.1.7 combined with 
Lemma 9.1.9 suggest that we should search for the points s maximizing 
the variance of Bd{s), 

ir n ^ \ I ^ m2 2 ^ sin^(A:7rs) 

Since YaxBd{s) = Var5d(l — s), it suffices to locate the maximum in 
[0, 1/2]. We differentiate the variance, obtaining 



d ^ , ^ / X 2 v-^ 2 sin(fc7rs) cos(fc7rs) 
-VarB,(.) = -, J: 

2 sin(2/c7rs) 



7^2 E 

l^k^d 

It follows from Jackson's (1912) theorem — see, e.g., Andrews, Askey 
and Roy, 1999, chapter 7 — that {d/ds)YsixBd{s) is positive on 
(0,1/2). Thus Bd{s) has a unique point of maximal variance for 
s = 1/2, no matter what d is. The maximal value depends on d. 
It is 

ai = Vari?,(l/2) = A J] 1. 

l^k^d 
k odd 
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Since the variance is maximum at a unique point, Proposition 9.1.7 
suggests that the dominating manifold in our problem is a single point 
p* = p(l/2)/|p(l/2)|2. Its dimension is A; = 0. 

We then calculate I{Ai) = \p*\'^/2 = l/{2aj). The level surface 
^i(Ai) is the sphere of radius l/cr^, centered at the origin. Its second 
fundamental form is a^ildd^i at every point. 

Let us now calculate the second fundamental form of dAi. To 
parameterize dAi, we follow the construction in section 9.1, with 
the simplification given in section 9.2 for the special case of a radial 
function /(•). The vector r = p' /\p'\ is a unit tangent vector to the 
curve M = p([0, 1 ]). As defined in section 9.1, let be a unit normal 
vector to M in {TpM + pR) Q TpM, that is 



Notice that being maximal for s = 1/2, the vectors p(l/2) and 

r(l/2) are orthogonal, and 1^(1/2) equals p*\p{l/2)\. 

Let Xi, . . . , Xd_i be an orthonormal moving frame in M'^ {TpK + 
pM). Using the simplification pointed out after (9.2.4), 

defines a local parameterization of dAi, or equivalently of OCm, near 
p* when s is chosen close to 1/2 and («2, . . . ,tid-i) close to 0. To 
calculate the second fundamental form of dAi, recall that p{s) is an 
outward normal to dAi at X{s, U2,. ■ ■ , Ud-i) thanks to Lemma 9.1.6. 
In particular, 

TxOCm = {p}^ = span{ X2,..., Xa-i } © (span(p, r) pR) . 

We complete X2, . . . , X(i_i into an orthonormal basis of TxOCm by 
adding the vector field 

^_ Projp±T _ 1 tIpI"^ - {t,p)p 



|Projpxr| \p\ ^|p|2 - (r,p)2 



We have 



dp-Xj = for J = 2, . . . , d " 1 , 

expressing the fact that dAi is a ruled surface with flat generators in 
the space spanned by X2,. ■ ■ ,X(i-i. Thus, the second fundamental 
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form of dAi vanishes on X2,. 
e,X2, . . .,Xii^i is then 

/ {dp ■ e, e) 



,Xd-i. Its matrix in the basis 











V 







0/ 



It remains to calculate {dp ■ e, e) on the dominating manifold p* = 
X{l/2,0, ... .0). The vectors p{l/2) and r(l/2) being orthogonal, 
e(l/2, 0, . . . , 0) and r(l/2) are equal. Since r is a unit tangent vector 

to the curve p([0, 1 ]), we have 

(dp(l/2) • e(l/2, . . . , 0), e(l/2, 0, . . . , 0)) = {dp ■ r, r) (1/2) 



|r(l/2)P 



1. 



In conclusion, on the dominating manifold M, and in the basis 
(e, X2, . . . , Xa-i), the fundamental form of dAi an 

\ 

and Ha^^^^j = 



n, 











/ 





are 


a 








a 



In particular, Ilg^^ — Ha^^^^j is diagonal. Its upper left entry vanishes. 
Because the dominating manifold "Dqcj^j is a point, HoAi — nAj(^^) = 
Gai', thus detG^j is null, and assumption (7.5) does not hold. 

The fact that the fundamental forms have the same upper left entry 
expresses the fact that along the tangent direction e(l/2, 0, . . . , 0), the 
surfaces ^i(Ai) and OCm pull apart very slowly. The following pictures 
illustrate this fact when d = 2 and d = 3. 

















/J 



d = 2 

(the little loop next to the origin is M = p([0, 1]).) 
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d = 3 

The boundary of Ai is a ruled surface. This picture shows the ruled surface 
obtained from the parameterization X{s, U2), with no constraints on U2- 
The actual set Ai is in the convex hull of the piece containing the origin. 
The curve drawn on the surface is X(s, 0) = i^(s)/(i^(s), p(s)). 



e-M e-M 




In this picture, we can see the sphere /\i(Ai), centered and of radius l/cra. 
The transparent surface is dAi, parametrized by X(s, 02), but with 02 
constrained to be negative. Its generators are straight lines, parameterized 
by U2, shown here for 02 negative. Thus, the surface dAi is cut along the 

curve X{s, 0). The projection of this curve on the sphere /^i(Ai) can be seen. 
The set dAi has a unique contact point, p* with the level set /^i{Ai)- But, 
as it can be seen along the curve X(s, 0), the boundary dAi pulls away very 
slowly from J\i(Ai) near p*. 

In the current situation, it is easy to obtain the desired equivalence 
for the tail of the supremum, because /(•) is homogeneous. If this tail 
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were behaved like that of the process at its point of maximal variance, 
we would have a tail equivalence in 

P{ Ba{l/2) ^t} = l- ^t/aa) ~ -i=^ exp ( - ^) . 



The tail behavior turns out to be much more surprising. Even and 
odd dimensions d yield different exponents in the polynomial term. 
The reason is that for even dimensions, the set OCm pulls away more 
slowly along the direction e(l/2, 0, . . . , 0). The contact between the 
two surfaces is of order 1 for odd dimensions, and of order 3 for even 
ones. 

12.4.1. THEOREM. As t tends to infinity, the ratio 
P{ sup Bd{s)^t}/ {l^^t/ad)) 

is equivalent to 

(i) \/d+ l/(Tii if d is odd; 

(it) ^r(l/4)(^p^)'^^^-^-2 ifd is even and d 7^ 1; 
(Hi) 1 if d= 1. 

REMARK. We will see that lima^^al = 1/4. Since d is 
increasing, we have cr^ < 1/4 for any integer d. Thus the constant 
in statement (i) of Theorem 12.4.1 is always larger than 1. This is in 
agreement with the failure of assumption 5.3. In the same spirit, the 
quantity involved in statement (ii) is also larger than 1 when t is large 
enough. 

Proof of Theorem 12.4.1. We first proof statement (iii). Since 
Bi{s) = — Xi sin(7rs) , 

TT 

its supremum is if Xi is negative, and \/2-^i/vr otherwise. The 
result follows. 

From now on, assume that d ^ 2. The change of variable y = tc 
allows to write 

P{ sup Ba{s)^t} = -\^J e-l^lV^dy 

f _+2u|2 
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Thus, to prove Theorem 12.4.1, we need to estimate the integral 



/ 

JA^ 



for large values of A. It is plain from standard results on Laplace's 
method, or from chapter 7, that we can restrict the integration to the 
intersection of Ai with an arbitrary neighborhood of the dominating 
manifold p*. Making use of the construction in section 9.1, we can 
write any point x of Ai near p* as 

x{s,U,v) = w + UjXj{s) + Vp(s) , V^O, 

where u = (it2, . . . , itd-i) is in a neighborhood of the origin. Be- 
cause . . . , Xd-i, V is an orthonormal basis and p is orthogonal to 

^2, • • • , Xd-i, 

{v,pY 

To calculate the Jacobian of the change of variable x ^ {s,u,v), is 
easy. Write 



dx , ^ dx 



dui ' dv ^ ' 



dx d ( v(s) \ sr^ d , , , 

Ts^d-s\ {.{slp{s)) )^ 22 u,-Xjis)+vp{s). 

Introducing w = p\p'\'^ — {p,p')p', we see that v/{v,p) = w/{w,p). 
Thus, 

w' {w',p) + {w,p') 



{u,p)J {w,p) {w,p)^ 
It is straightforward to calculate 

w' = 2p{p',p")-{p,p")p'-{p,p')p". 

Since |p(s)p is maximal at s = 1/2, the tangent vector p'{l/2) is 
orthogonal to p{l/2). The specific form of p(-) in this problem yields 
that for any j, the derivative d^p/ds^ at s = 1/2 involves only the 
vectors with k odd if j is, and k even otherwise. Consequently, 
p'{l/2) and p"(l/2) are orthogonal, and 

u;(l/2) =p|pf (1/2), w'il/2) = -{p,p")p'il/2) 

d ( fjs) \ ^ {P,P")P 



ds\{v{s),p{s)) 



(V2) = -j^(l/2). 
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Thus, in the orthonormal basis r, X2, . . . ,-^^^-1) the Jacobian matrix 
is 







V 







+ 0{\x—p*\) as X — > . 



1/ 



Now, let e be an arbitrary positive real number. In what follows, 
r] denotes a positive real number, which we will choose as small as 
needed. Denote by D{r]) the domain 

D{r]) = {x{s, u,v)eM.'^ : \s - l/2\ ^ rj , \u\ ^ ij , ^ v ^ rj} . 

Prom our evaluation of the Jacobian, if 77 is small enough 



Iai 



-A|x|V2d, 



nD(r?) 



^(1 + e) 



{p,p" 



W\p' 



^(1/2: 



exp 



D{v) 



2V(Ks),p(5))2 



+ |Mp +v'^\p{s)\ + 2v) dudvds. 



)) 



We first perform the integration in u, obtaining the upper bound 



+ v'^\p{s)\'^ + 2v] ]dsdv 



')) 



n+e)^^(l/2)M!:^x 
^ ^\pm^ ^ ' ;^(d-2)/2 X 

I\s-i/2\^ri lo^v^v^^^ ( 2 ((zy(s),_p(s))^ 
To perform the integration in f , we write 

e^p(^-^{v'\p{s)\' + 2v)) 

= {X{v\p{s)\' + 2))^'^ exp {^{v'\p{s)\' + 2v)) 

and integrate by parts. We obtain 

/ exp(-^{v^\p{s)\'^ + 2v))dvr^\ 
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as A tends to infinity. This yields the upper bound 

a-)«a/^)^/exp(-^M.).P(.)>id». 

To estimate this last integral boils down to using the classical Laplace 
method. We introduce 

We then have 

/ exp ( - ds 

We then need to obtain a Taylor expansion for near 1/2. To this 
end, for any integer m, we define 

Since 

, , ,,9 2 v-^ sin^(/c7rs) 1 v-^ 1 — cos(2A;7rs) 

we easily obtain the derivatives (d'^/ds™')(|p(s)p) at s = 1/2, for 
m = 0, 1, . . . , 4. Using Taylor's formula, we infer that 

|;.(l/2 + = |M1/2)|' + 2/i'5o - ^K^h^S-^ + 0{h'') . 
We also have 

, 2 ■r-^ sin(A;7rs) cos(A;7rs) 1 sin(2fc7rs) 

M{s) = - ^ = - ^ . 

Hence, a simple calculation of the derivatives at s = 1/2 and an 
application of Taylor's formula give 

{p,p') {1/2 + h) = 2hSo - U^h^S2 + 0{h^) . 
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Finally, 

\p\s)\^ = 2 ^ cos^{kTTs) = d+ ^ cos{2kTTs)^ 

from which we deduce \p'{l/2)\'^ = d + Sq, and 

\p'{l/2 + MP = b'(l/2)|' - 2Tr^h'S2 + 0{h^) . 
Equipped with these expressions, a little algebra gives 

6{l/2 + h) = 6{l/2) + 2So{l-j-^^y 



\p'{l/2W 

2 So_ 

3 b'(l/2)P 

The interesting fact is now that equals —1 if d is odd, and equals 
if d is even. Thus, if d is odd, 

2 

\p'{m\' 

while if d is even, 



5(1/2 + h)= 6(1/2) - 2(1 + T-TT^y + 0(h') 



5(1/2 + h) = 5(1/2) - + 0(h^) . 

Notice that when d is even, 5*2 is positive since 

(-i)^k^= E (2fc)^ - (2fe - 1)' = E (4^-1) 

= m(2m + 1) . 

Hence, S(s) is maximal at s = 1/2 whatever the parity of d is. 

To conclude the proof of Theorem 12.4.1, assume first that d is odd. 
Then, for 77 small enough, (12.4.2) is less than 

V2^(l + e)5(l/2) 1 



Vl + 2b'(l/2)|-2VA' 



All the arguments we used to obtain this upper bound can be used 
to obtain a lower bound, essentially by changing e to — e. Since e is 
arbitrary, combining all the estimates yields 



P{ sup Bd(s) ^t}^ J ^^^7 exp ( - 
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When d is even, for 77 small enough, the integral in (12.4.2) is less than 

r(i/4)/3y/4 / (TT^M 

AV4 \S2) V TT 

It then follows that 

. Vd Til/A)/ 6 \V4 1 / ^2 

This concludes the proof of Theorem 12.4.1. ■ 

To conclude this section let us make a remark on a^. Recall that 
X;fc>i(2A; + 1)"^ = vr^/S. Consequently, lim^ 

— >oofrf — 2, which is 

good, given (12.4.1). We can obtain a more precise estimate of cr^. 
Since 



L W+W ^ ^^W+W ^ Jrr 



dx 



{2x-l) 



2 ' 



and 



1"^ dx 
L (2a: + 1) 



;~ dx dx 



)^ -/m -/m (2a: 1)2 

we have 

I V- 1 1 /"^ 1 1 , 

I {2k + 1)2 4m 7^ (2x - 1)2 (2x + 1)2 

1 



2(4m2 - 1) 

Consequently, 



.2 1 1 
^'^"4"2^ 



1 



7r2(4rf2 - 1) 



Though the constant in the exponential term in Theorem 12.4.1, 
namely 1/(2(7^), has the right limit as d tends to infinity, we cannot 
take the limit of the polynomial term and recover (12.4.1). This is 
caused by the slow convergence of to the Brownian bridge. 

Again, the whole message is to be rather cautious with these 
approximations. Strange things may happen and further examination 
is certainly needed if they are to be used in serious applications. 
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12.5. Polar processes on boundary of convex sets. 

Let C be a bounded convex set in R'^, with nonempty interior. 

The purpose of this section is to construct a simple process on its 
boundary. This construction is suggested by Theorem 7.1 and the 
results of chapter 9. 

We first need to make some remarks on densities proportional to 
with I strictly convex and a-homogeneous. Recall that Ac denotes 
the level line /~^({c}). Every nonzero point x of can be written 
in a unique way as x = sA for some positive s and A in Ai. In this 
(A, s)-coordinate system, the measure e^^ can be rewritten as 



Conversely, such a measure corresponds to a log-concave and log-a- 
homogeneous measure on R*^. 

Going back to the convex set C given at the beginning of this 
section, assume that it also contains the origin. Let A be the polar 
reciprocal of dC. For A in A and nonnegative s, define /(sA) = s". 
Then, A = Ai for this specific function /. Equip R*^ with the log- 
concave density proportional to (13.5.1), and let X be a random vector 
having this density. We can consider the process p G dC i— (X, p) G 
R. We call this process a polar process on dC. 

The tail distribution of its supremum is given by the following 
result. 

12.5.1. THEOREM. For the polar process on dC defined above, and 



as t tends to infinity. 

Proof. We apply Theorem 7.1. In view of section 9.1, the dominat- 
ing manifold is A, of dimension k = d—1. Moreover, by construction, 
dAi = Ai. It follows from Theorem 7.1 that the tail equivalent is of 
the form 



To calculate D7, notice that it is the outward normal to A. Its norm 
is obtained through the identity 



s'^-ie-""|Proj^^AxA| dsdMA,{X) . 



(13.5.1) 



l/c = 4.e-^(-)d,x, 





^/(A.)|^^,=D7(A).A=A(,"/(A))| 



s=l 



aI{X) = a . 
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It implies 

|D/(A)|=a/|Proj(:r,A)^A|, 
and the result follows. ■ 

The correspondence between points p on dC and points A on the 
polar reciprocal allows us to write iProj^^^j^^^xAl = l/\p\- Thus the 
integral in Theorem 12.5.1 can also be rewritten as 

which is a nice formula. 

The following pictures are three simulations of a polar process on 
the ellipsoid + 2y'^ = 1 in R^, and a = 1. For the pictures on 
the left, at every point p of dC\ we draw a segment in the normal 
direction to TpdC, with length equal to the value of the process at 
p. The random point X is indicated by the star ^. On the right 
hand side pictures, the process is represented as a curve held over the 
ellipsoid, the height of the curve being the value of the process; these 
are orthographic projections. 
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Notes 

I do not think the results in this chapter are too serious! However, 
I beheve the message of caution that some carry is important. In a 
different vein, Feynman supposedly said that when you have a theory, 
you should show where it works and where it does not. That may be 
the point of section 12.2. 

Section 12.5 is just a way to generate pretty pictures of random 
caustics. I do not know any applications of this construction. 



Appendix 1. Gaussian and 

Student tails 



This appendix collects a few standard results related to the tails of 
the Gaussian and Student distributions. We write 

/ X^\-("+l)/2 

s^{x) = Ks,a[l + —) , a;eM,Q;>0 

for the Student density (with a + 1 degrees of freedom) , the constant 
Ks,a ensuring that Sa integrates to 1 over the real line. We denote by 



J — oo 

the Student cumulative distribution function. Its tail is given in the 
following result. 



A. 1.1. LEMMA. We have 

K a("~^)/^ / i \ 
l-S^{x) = ^^ + 0[^) asx^oo. 



Proof. Notice that as y tends to infinity, 

\ya+3j 

Consequently, as x tends to infinity, 

1-Saix) f°° dy /•~a{"+i)/2 



r dy /•oo aia+i;/^ / 1 X 



(q-1)/2 ^ 
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Let US now consider a Student-like cumulative distribution function 
Sa, i.e., such that 

Sa{~x) ~ 1 - Sa{x) ~ ■ , 

as X tends to infinity, and where the constant Kg ^^ can be any fixed 
positive number. We can obtain an asymptotic formula for the high 
quantiles. 

A.1.2. LEMMA. The following holds, 

(1 — Sa)'^{u) ^ — 71 as ^ . 



Proof. Let x tends to infinity and u tends to such that u = 
1 — Sa{x), that is a; = (1 — Sa)^{u). From the Student-like tail, we 
infer that 

u ~ — ■ , 

which is the result. ■ 

We can obtain similar results for the Gaussian distribution with 
cumulative distribution function 



) V^TT 



A.1.3. LEMMA. We have 



1 - <I>(x) = — =(l + o(l)) as x^oo. 



Proof. Integrate by parts to obtain 

v^(l - <^{x)) = I -ye-y^/^Ay 

Jx y 



' X 



/•~ e- 






2 '^y 


J X 


yZ 






3x3 





X 



X Sx^ Ay 



-dy. 
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Consequently, 



/27r Sx^'J V27r a:; 

and the result follows. ■ 

We can now obtain an asymptotic approximation for high quantiles. 
The second statement in the following lemma is instrumental in the 
sequel. 

A.1.4. LEMMA. We have 

21^2 log l/'u -^21ogl/'u 

as u tends to 0, and 

= 21ogl/it-loglogl/it-21og(2V7r)+o(l) as u ^ . 



Proof. As in the proof of Lemma A. 1.2, we start with the equality 
u = 1 — ^{x), that is a; = (1 — ^)^{u). We consider x tending to 
infinity, or equivalently u converging to 0. Lemma A. 1.3 implies 

log^z = — — — logx — log-\/27r+o(l) asx^oOjti^O. (A.1.1) 

Consequently, x = a/2 log l/it(l + a;i) with xi tending to as u tends 
to 0. Then (A.1.1) yields 

logM = (- log-)(l + - ^ log log - - log(2V7r) 

-log(l + a;i) + o(l) 

= log u — 2x\ log x\ log log log — 

u u 2 u 

- log(2V^) + o(l) . (A.1.2) 

We can then calculate 

loglogl/?x 
41ogl/u 
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with X2 tending to with u. But now (A. 1.2) imphes 

= ^^loglogl/u- — ^ (1+X2 

2 16 logl/tt 

- ^ log log - log(2v^) + o(l) 

= ^loglog--log(2V^) + o(l), 
and thus, as u tends to 0, 

a;2 = — ] — — (l + o(l)j . 

loglog 1/ti ^ ^ 
Consequently, gathering every piece yields 

as u tends to 0, which is the desired expression for $*~(1 — u). Square 
it to obtain that for $"^(1 — u)^. ■ 

We can now obtain an asymptotic expansion for o 5q. 

A.1.5. LEMMA. We have 

^ , . log log .X log(X.,»a"/^2v/?) 

$ oS'„(a;) = V2alogx- — == ,^ , 

2v2aloga; v2alog.x 



+ 



4^ 

V Vlog X 



Viogx/ 

as X tends to infinity. Consequently, 

$"-0 Se,{xf = 2a log a; - log log a; - 2log{Ks,aa''/^2^/^) + o(l) 
as X tends to infinity. 

Proof. From Lemma A. 1.4, we deduce that 

$^0 Sa{x f = -2 log (1 - Sa{x)) - log ( - log(l - 

-21og(20F) + o(l) 
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as X tends to infinity. But Lemma A. 1.1 implies 

- log (1 - Saix)) = alogx - log(K,,„a("-i)/2) + ^(l) 

as X tends to infinity. Consequently, 

$""0 Saix)"^ = 2Q;loga; - 2log{Ks,a(x^°'~^^^'^) - loglogx - loga 

-21og(2^^F) + o(l) 

as X tends to infinity, which is the second expansion in Lemma A. 1.5. 
Taking the square root yields the first assertion since 

^'^oSaix) 

/— / log log X log(X,,„a«/220F) / 1 XX 1/2 

= V2aloga;( 1- — + o[- j j 

\ Za log x a log X \ log .x / / 

= ^2^(1 - '-^^ - l'^g(y ^^20F) ^ / I XX _ 
\ 4aloga; 2aloga; \logx// 



We can also obtain an estimate of S^o ^[x). 
A.1.6. LEMMA. We have 

X^ 1 1 o-l 

log $(x) = — + - logx + - log{Ks,aa~V2Tr) + o(l) 
2a a a 

as X tends to infinity. 

Proof. Combine Lemma A. 1.2 and A. 1.3 to obtain 

Sro$(x) = (l-5„)^o(l -$)(:,) 

as X tends to infinity. The result follows by taking the logarithm. ■ 

In the case of an exact Student distribution, the term o(l) in 
Lemma A.1.6 is actually 0{x~^/°'e~^ /"^). In this special case, the 
approximation has a terrific accuracy as x tends to infinity! 



Appendix 2. 
Exponential map 



The purpose of this appendix is to state and prove the following 
proposition. It gives a bound on the error committed by linearizing 
the exponential map over the level set of a function. 

A.2.1. PROPOSITION. Let I :R'^ ^ R be a smooth smooth function, 
and let c be a regular value of I. Then Ac = I~^{c) is a smooth 
manifold ofM!^. Let 

Then, for any p in Ac, any u in TpAc with \u\ < 1/4M, 
I exp (n) — p — u\ ^ M|up . 



Proof. Let p be a point in Ac and v a unit tangent vector to Ac 
at p. We denote by 7(5) = cxpp{sv) the geodesic starting at p in the 
direction u on Ac- This parameterization is by arc length. Recall that 
A'' = D//|D/| is an outward unit normal vector field to the level set 
of /. Since 

dN{p)= (id -^NN^)^^{p), 
we have ||dA^(p)|| ^ M. Consequently, 

|A^(7(s)) -N{p)\ = I f dN{^{r)) ■^'{r)dr\ ^ sM . 
Jo 

The geodesic 7(-) is characterized by the parallel transport of its 
tangent vectors along 7(5) which can be written as 

[Id - NN^{j{s))]^"{s) = Proj:r,(.)Ae/(s) = 0. 

Any w in TpAc is orthogonal to M{p). Consequently, 

{w,^"{s)) = {w, [NN^^i^is]) - NN'^ip)] . y'is)) . 
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By duality, 

|ProjT,A.7"(s)| ^ \\NN^{j{s))-NN^{p)\\ 

^2\N{j{s))^NN^{p)\\^"{s)\ 
^ 2sM\^"{s)\ . 

Moreover, since {N(j{s)) ,7'(s)) = and 7 is parametrized by arc 
length, 

\{N{j{s)) ,j"{s))\ = I - {dN{j{s))-j'{s),j'{s))\ ^ M. 
It follows that 

|Proj(T,A.)x7"(^)l = KA^(p),7"(^))l 

^ \N{p)~N{j{s))\W\s)\+M 
^ sM\-f"{s)\ + M . 

Consequently, we have the inequality 

1/(5)1^ = |Proj^^^^7"(«)l' + |Proj(T,A.)x7"(«)|' 
^ 4s2m2|7"(s)|2 + M\s\y'is)\ + if . 

On the range s ^ 1/ (4M) , this inequality implies 
that is 

|7"(s)p ^ 16MV5 ^ 4M^ 
Consequently, since 7'(0) = v, 

|7'(s) -^;| = I / j"{t)dt\ ^ 2Ms. 
Jo 

The result follows, since 

\-f{s)-p-sv\ = \ {-f'{t)-v)dt\ ^ 2Mtdt = Ms^ = Mlsvl"^ . m 
Jo Jo 
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Notation 



This index has 3 parts. The first one concerns general notation, that is 
used throughout the book, eventually with variations in the arguments in each 
example. 

The second part contains the notation introduced in chapters 2-5. This is used 
almost throughout the book, but in examples treated in chapters 6-12 is given a 
specialized meaning. 

The third part contains the notation with the concrete meaning given through 
chapters 6-12. 

Some notation that are used very locally does not appear in the index. 

General notation. 



tt the cardinal of a set, as in jj{ 1, 2, 3 } = 3. 

d integration element, as in dt, dMM, dx. 

d dimension of the underlying space, R''. 

D, D^, . . . Gradient, Hessian, and higher order differentials. 

expp(-) exponential map at p (on a Riemannian manifold). 

Mm Riemannian measure of a manifold M. 

orthocomplement. 

injjj^(p) radius of injectivity of p in the manifold M. 

KM{x,y) sectional curvature of the manifold M along the tangent 

vector fields x and y. 

Amin(M), Amax(M) Smallest and largest eigenvalue of a matrix M. 

dA boundary of a set. 

df{-)/du partial differentiation of a function. 

IIm.p second fundamental form of the manifold M at p. 

R, R'* set of real numbers, the Euclidean d dimensional space. 

Rice Ricci tensor of the level linos of /. 

S„ unit sphere centered at the origin, of dimension n, that is 

the boundary of the unit ball in R"+^. 

Sv{x, r) ball centered at x, of radius r, in the vector space V . 

TpM tangent space of the manifold M at p. 

NpM normal space of the manifold M at p. 

\S\ Lebesgue measure of the set S. 



a;„ = 7r"/r((n/2) + 1) volume of the unit ball of R". 
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Notation from chapters 2-5. 



A, 1 


Tc, 19 


N{x), 26 


Alm^ 53 


/(•), 19 


WA, 37 


Ap{t,v), 38 


7(^), 19 


TTT 44 


ca,m, 54 


J-KAi-), 38 


V'(a::,s) = 26 


CA,M, 54 




^t*i-), 28 


Xa(p), 33 




io,M(p),56 


Xa(p), 33 


Ac, 25 


rA(p), 31 


Va, 36 


L(c), 19 




Ga(p) ,54 


Ma(-B), 62 




Notation from chapters 6-12. 






Chapter 6 




Kt , p.70 


/p(-) , P-71 


n(-) , p.75 


, p.71 


Qp , P-71 


r , p.75 




Chapter 7 




Ai, 79 


/(•) , 79 


Va, , 80 



Chapter 8 



§1 71, 100 Pt, 116 

C, 89 X»B(, 100 rx,t{m,v), 116 

ylt, 89 T, 108 r>(S,t), 116 

Wc{-), 89 T, 108 116 

/(•), 89 Vx, 108 rx,t{m), 116 

i?, 90 Ar(C), 108 p(m), 116 

§2 J(C), 108 ec, 127 

if.,,., 93 At, 109 Mc, 128 

At, 93 Mi, 109 Afo(C), 137 

Ji, 93 G(m), 109 Jo(C), 137 

*(•), 94 Afx, 111 Me,i, 137 

5<,(-), 94 Q(t), 111 Zx{C), 138 

Bt, 94 px,t(m,i)), 112 S'i(C), 139 

/(•), 95 qx,t{m,v), 112 §3 

95 R{t), 113 J*(C), 144 

P,,jAv), 95 71, 113 Mi, 144 

q{v), 96 7, 113 71, 144 

gej,tW, 96 ©fl.,116 7,145 



Notations 



313 



/(•), 145 A[, 150 pt, 157 

R{t), 145 B't, 150 qi,t{m), 157 
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Post face 



This book is the first of a larger project that I may try to complete. 
A second volume should be devoted to the asymptotic analysis of 
multivariate integrals over small wedges and their applications. A 
third one should extend some of the results of the first two volumes 
to the infinite dimensional setting, where there are some potentially 
amazing applications in the study of stochastic processes. 
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